
An Introduction to 

Mechanical 

Engineering 


FREEYVftjf 

RTfrfXJPCff-il 


Michael Clifford (editor), Richard Brooks, 
Kwing-So Choi, Donald Biddings, Afan Howe, 
Thomas Hyde, Arthur Jones and Edward Williams 


HODDER 

EDUCATION 


An Introduction to 

Mechanical 

Engineering 

Michael Clifford (editor) 

Richard Brooks 
Kwing-So Choi 
Donald Giddings 
Alan Howe 
Thomas Hyde 
Arthur Jones 
Edward Williams 


University of Nottingham 


This page intentionally left blank 



An Introduction to 


Mechanical 

Engineering 


Michael Clifford (editor), Richard Brooks, 
Kwing-So Choi, Donald Giddings, Alan Howe, 
Thomas Hyde, Arthur Jones and Edward Williams 


h 


HODDER 

EDUCATION 


AN HACHETTE UK COMPANY 


First published in Great Britain in 2010 by 
Hodder Education, An Hachette UK Company, 

338 Euston Road, London NW1 3BH 

© 2010 Michael Clifford, Richard Brooks, Kwing-So Choi, Donald Giddings, Alan Howe, Thomas 
Hyde, Arthur Jones and Edward Wilhams 

Ah rights reserved. No part of this publication may be reproduced or 
transmitted in any form or by any means, electronically or mechanically, 
including photocopying, recording or any information storage or retrieval 
system, without either prior permission in writing from the publisher or a 
hcence permitting restricted copying. In the United Kingdom such licences 
are issued by the Copyright Licensing Agency: Saffron House, 6—10 Kirby Street, 

London EC1N 8TS. 

Hachette UK’s policy is to use papers that are natural, renewable and 
recyclable products and made from wood grown in sustainable forests. 

The logging and manufacturing processes are expected to conform to the 
environmental regulations of the country of origin. 

The advice and information in this book are believed to be true and 
accurate at the date of going to press, but neither the authors nor the publisher 
can accept any legal responsibility or liability for any errors or omissions. 

British Library Cataloguing in Publication Data 

A catalogue record for this book is available from the British Library 
Library of Congress Cataloging-in-Publication Data 

A catalog record for this book is available from the Library of Congress 


ISBN: 978 0 340 93996 3 
123456789 10 

Typeset in 10.5/12pt Bembo by Tech-Set Ltd, Gateshead 

Printed and bound in Italy for Hodder Education, an Hachette UK Company. 

What do you think about this book? Or any other Hodder Education tide? Please send your commen 
to educationenquiries@hodder.com 


v.hodderedu 


Contents 


Introduction 

vii 

[ Unit 1 - Fluid dynamics 

lJ 


1.1 Introduction 

1 

1 .2 Basic concept in fluid dynamics 

2 

1 .3 Boundary layers 

8 

1 .4 Drag on immersed bodies 

19 

1 .5 Flow through pipes and ducts 

25 

1 .6 Dimensional analysis in fluid dynamics 

34 

Unit 2 - Thermodynamics 

46 

2.1 Introduction 

46 

2.2 Air conditioning 

57 

2.3 Gas mixtures 

69 

2.4 Combustion 

74 

2.5 Reciprocating compressors 

90 

2.6 Heat transfer 

96 

2.7 Heat exchangers 

109 

2.8 Vapour power cycle 

119 

2.9 Reciprocating internal combustion engines 

131 

Unit 3 - Solid mechanics 

138 

3.1 Introduction 

138 

3.2 Combined loading 

139 

3.3 Yield criteria 

144 

3.4 Deflection of beams 

149 

3.5 Elastic-plastic deformations 

159 

3.6 Elastic instability 

168 

3.7 Shear stresses in beams 

184 

3.8 Thick cylinders 

195 

3.9 Asymmetrical bending 

207 

3.10 Strain energy 

217 





An Introduction to Mechanical Engineering: Part 2 


3.11 Fatigue 227 

3.12 Fracture mechanics 231 

3.13 Thermal stresses 239 


Unit 4 - Electromechanical drive systems 


4.1 Introduction 247 

4.2 Characteristics of loads 248 

4.3 Linear and rotary inertia 248 

4.4 Geared systems 250 

4.5 Tangentially driven loads 256 

4.6 Steady-state characteristics of loads 259 

4.7 Modifying steady-state characteristics of a load using a transmission 265 

4.8 Sources of mechanical power and their characteristics 266 

4.9 Direct current motors and their characteristics 268 

4.10 Rectified supplies for dc motors 285 

4.11 Inverter-fed induction motors and their characteristics 292 

4.1 2 Other sources of power: pneumatics and hydraulics 305 

4.13 Steady-state operating points and matching of loads to power sources 311 


Unit 5 - Feedback and control theory 


5.1 Introduction 317 

5.2 Feedback and the concept of control engineering 318 

5.3 Illustrations of modelling and block diagram concepts 321 

5.4 The s domain: a notation borrowed from mathematics 326 

5.5 Block diagrams and the s notation: the heater controller and tensioning system 331 

5.6 Working with transfer functions and the 5 domain 334 

5.7 Building a block diagram: part 1 337 

5.8 Building a block diagram: part 2 344 

5.9 Conversion of the block diagram to the transfer function of the system 348 

5.1 0 Handling block diagrams with overlapping control loops 350 

5.11 The control algorithm and proportional-integral-derivative (PID) control 352 

5.12 Response and stability of control systems 354 

5.1 3 A framework for mapping the response of control systems: the root locus method 365 


Unit 6 - Structural vibration 


6.1 Introduction 376 

6.2 Natural frequencies and mode shapes 377 

6.3 Response of damped single-degree-of-freedom systems 405 

6.4 Response of damped multi-degree-of-freedom systems 424 

6.5 Experimental modal analysis 431 

6.6 Approximate methods 437 

6.7 Vibration control techniques 446 

Questions 457 

Index 473 





Introduction 


Introduction 


7 think I should understand that better’, Alice said very politely, ‘if I had it mitten down: but I can’t quite 
follow it as you say it.’ 

(, Alice’s Adventures in Wonderland by Lewis Carroll) 

This book builds on the experience and knowledge gained from An Introduction to Mechanical 
Engineering Part 1 and is written for undergraduate engineers and those who teach them. These 
textbooks are not intended to be a replacement for traditional lectures, but like Alice, we see 
the benefit of having things written down. 

In this book, we introduce material to supplement the foundational units in Part 1 on solid 
mechanics, thermodynamics and fluid mechanics. In addition, the reader will encounter units 
on control, electromechanical drive systems and structural vibration. This material has been 
compiled from the authors’ experience of teaching undergraduate engineers, mosdy, but not 
exclusively, at the University of Nottingham. The knowledge contained within this textbook 
has been derived from lecture notes, research findings and personal experience from within the 
lecture theatre and tutorial sessions. 

The material in this book is supported by an accompanying website: www.hodderplus.co.uk/ 
mechanicalengineering, which includes worked solutions for exam-style questions, multiple- 
choice self-assessment and revision material. 

We gratefully acknowledge the support, encouragement and occasional urgent but gentle prod 
from Stephen Haider and Gemma Parsons at Hodder Education, without whom this book 
would still be a figment of our collective imaginations. 

Dedicated to past, present and future engineering students at the University of Nottingham. 

Mike Clifford, August 2010 
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Unit 1 


Fluid dynamics 


UNIT OVERVIEW 

■ Introduction 

■ Basic concept in fluid dynamics 

■ Boundary layers 

■ Drag on immersed bodies 

■ Flow through pipes and ducts 

■ Dimensonal analysis in fluid dynamics 


1.1 Introduction 

Fluid dynamics is the study of the dynamics of fluid flow. Here we learn how flows behave 
under different external forces and conditions. In a sense this is similar to rigid body dynamics 
in physics, where Newton s second law is used to describe the motion of rigid bodies. Here, 
we must apply Newtons second law to fluid flows in a different way since fluids do not 
behave exactly like rigid bodies. This will be discussed in Section 1.2, where basic equations to 
describe fluid motion are derived and explained. Some discussions on laminar and turbulent 
flows are also given there, paving the way for what will follow. 

The fluid that we deal with in this unit is a viscous fluid, so the velocity of fluid flow becomes 
zero at the solid surface. The consequence of this no-slip condition is that flow velocity 
changes from zero at the wall to the free-stream value sufficiently far away from the wall 
surface. This thin layer is called the boundary layer, an important concept in fluid dynamics, 
which explains how the fluid forces are generated. So, in Section 1.3, we learn the basic 
behaviour of boundary layers to be able to estimate the viscous drag acting on the solid surface. 

The boundary layers over solid bodies behave differently depending on their shape. For 
example, the drag force acting on sports cars is much less than that on pickup trucks, where the 
boundary layer is separated from the body surface of vehicle creating a strong flow disturbance. 
In Section 1.4 we study the streamlining strategy to reduce the drag force of immersed bodies. 
We also discuss how the drag of immersed bodies is affected by the Reynolds number as well as 
the wall roughness. 

Pipes and ducts are important engineering components used in many fluid systems. It is 
important, therefore, that the flow resistance can be correctly estimated for different type of 
ducts and pipes. In general there are two types of flow resistance. One is due to the friction drag, 
while the other relates to the loss of energy due to boundary layer separation. In Section 1.5, 
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we study a method of minimizing the flow resistance similar to the streamlining strategy for 
immersed bodies. The discussion of pipes and ducts is extended to non-circular shapes by 
introducing the concept of hydraulic diameter. 

The final section of this unit deals with the non-dimensional numbers of fluid dynamics. We 
are already familiar with the Reynolds number, but there are many other non-dimensional 
numbers in fluid dynamics. In Section 1.6, we learn how to identify relevant non-dimensional 
numbers for different types of fluid flow. We also study how these non-dimensional numbers 
are used to carry out model tests. Applications of the similarity principle to fluid machinery are 
given, emphasizing the importance of non-dimensional numbers in fluid dynamics. 


The main aim of fluid dynamics is to understand the dynamic behaviour of fluid flows. Since 
all fluids are continuous, we can determine the velocities and pressure of flows as a function of 
space and time. To achieve this, we require the governing equations to represent the fluid flows: 
the Navier-Stokes equations. 

For simplicity, we consider only two-dimensional, isothermal (no thermal input or output) 
and Newtonian (the shear stress is linearly proportional to the strain rate) flows with constant 
density and viscosity. Therefore, u, v and p (velocities and pressure in Cartesian coordinates) are 
function of x, y and t. If the fluid flows are steady, they are only functions of x and y. 

The Navier-Stokes equations can be derived by applying Newton’s second law of motion to 
fluid flow. By considering a small control volume dxdy with a unit depth, the fluid mass times 
acceleration is given in vector form by 


The x- and y-component forces, F x and F y , acting on the control volume are given by 


1.2 Basic concept in fluid dynamics 


Navier-Stokes equations 



(i.i) 


m-a x = P -dxdyjj = F x m- a y = p -dxdy— = F y 


(1.2, 1.3) 


Since, the total derivative (or material derivative) represents 


d_ = A dx _d_ fty _d_ = _d_ + _d_ 

dt dt dt dx dt dy dt U dx V dy 


_d_ 

V dy 


(1.4) 


Therefore, 



(15 16) Stress Ty acts on a surface 




whose normal is ^-direction 

j 


Here, we shall consider only those forces acting on the surfaces of the control 
volume, although a body force will be introduced later. Surface forces include 
hydrostatic pressure p, the normal stresses t xx , T yy and the shear stresses J xy , T yx . 


in i-direction 


Figure 1 .1 Stress tensor t,j. The first index / of the stress tensor t,j 
indicates the direction of the stress that is acting on the surface, 
whose normal direction is indicated by the second index j. 



2 
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y+dy- 

dy 


Control 

volume 


, DV . 

l^OT* 


x dx x+dx 

Figure 1 .2 The balance of surface forces in the x-direction 


From Figure 1.2, the total surface forces in the x-direction can be obtained as follows. 

[ (t** + - T xx dy\ + \ [j xy + dy)dx - T xy dx\ - \ (p + J^dx)dy - p dy] (1.7) 


dy 


dx 


Here, the forces acting on the surface at (x + dx) can be obtained from the forces on the surface ai 
x by using Taylor s expansion. 

In a similar way, the total surface force in the y-direction can be obtained from Figure 1.3 as 

dp 

_ r 


y 

"*-a dr \ Ivi^ 

dy 

v| 

Control 

volume 

W + ~dr dx 



1 1 

P fy 


0 4 

1 " 1 x 

c dx x+dx 


Figure 1 .3 The balance of surface forces in the y-direction 


Therefore, the total force acting on the surfaces of control volume can be given by 

dp dT yx 


dp dT xx dT xy \ 

■^ + _ d ^ + ~dir dv p y 


dy dx dy 


Jdxdy 


(1.9, 1.10) 


So far, we have considered only the surface forces. There are flows, such as water waves around a 
ship, where the gravity force plays an important role. Therefore, we should also consider such a body 
force acting on the control volume together with the surface forces. In such a situation, the vertical 
force F y in equation (1.10) must be replaced by 

/ dp dT yx dT yv 


( 1 . 11 ) 


Here, the additional force due to gravity is p g dx dy, which always ai 
(downwards). 


n negative y-direction 
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We finally obtained the following equations of fluid motion. 

(du du du\ _ dp Id T xx 

U dx V dy) dx \ dx 




dv 


I 


dy I 

dy) 


(1.12, 13) 


For the Newtonian flow, the stress is linearly proportional to the velocity gradient, we can write 
this relationship in a matrix form as follows. 


(1.14) 


By substituting this relationship into equation (1.12) and (1.13), we have the final form of the 
Navier— Stokes equations after dividing all terms by the fluid density p. 

du du du = _lfy ld 2 u d 2 u\ 

dt U dx V dy p dx V \dx 2 dy 2 ) 

(1.15, 16) 

dv , dv_ , fh _ _ 1_ , ( d 2 v , d 2 v \ _ 

dt U dx V dy~ p dy ^dx 2 dy 2 ) g 


where, v = pt/p is called the kinematic viscosity with dimension [L 2 T ']. 


Now, the Navier— Stokes equations consist of four parts, which are: 

(Inertia force) = (Pressure force) + (Viscous force) + (Gravity force) (1-17) 


In others words, the left-hand side of the Navier-Stokes equations represents the inertia force 
(i.e. mass times acceleration), while pressure force represented by the pressure gradient and 
viscous force led by the viscosity are on the right-hand side. The gravity force can usually be 
omitted from the equations unless the surface wave or the natural convection is involved. 


By examining the magnitude in each term of the Navier-Stokes equations, we can see that the 
left-hand side of the equations of is of the order of u 2 / L, while the right-hand side is of the 
order of vu/L~. Here, u represents the velocity scale (either u or v), L represents the length scale 
(x or y) and v is the kinematic viscosity. The ratio of these two will give a non-dimensional value 
called the Reynolds number: 

„ , , , uL inertia force 

Re (Reynolds number) = — oc — -? (1-18) 

x 1 v viscous force 






Re = 65 Re = 161 

Figure 1.4 Effect of the Reynolds number on the vortex shedding from a circular cylinder 

(H. Schlichting, 1968, Boundary Layer Theory, 6th edn, New York: McGraw Hill, reproduced with 

permission of the McGraw-Hill Companies) 
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Similarly, the ratio between the magnitude of the inertia term ( of the order of — j and that of 
the gravity term (of the order of^) is called the Froude number: 

F, (Froude number, = ^ CC W) 

Here, it is customary to take a square root of the ratio to define the Froude number. 

For example, a yacht with a length of L = 20 m travelling at u = 10 m/ s would have 
Re = 1.3 X 10 8 and Fr = 0.7 assuming that v = 1.5 X 10 _6 m 2 /s. 


Worked example 

Find the acceleration of a fluid particle with the velocity field given by 
V= ui + vj 

= (3 t)i+ (2xy)j 


Du = 
Dt 

Dy = 
Dt 

DV _ 
Dt 


du 
U dx 


Continuity equation 

The continuity equation can be derived in a similar way as we have done to obtain the 
Navier— Stokes equation. Here, however, we should consider the mass balance in the control 
volume instead of the force balance. 


y 



dy 

P"' 

Control 

volume 

p m + TF & 



t 

pv 

i 

) ; 

I ^ d ^ ^ d. ^ 


Figure 1.5 Mass balance within the control volume 


Mass flux into the control volume in the x-direction is given by 

[ d(p u) 1 d(pu) 

[(p«) ■ 4 + o^~ dx ' d y\ - (p«) • *i = 

while the mass flux into the control volume in the y-direction is 

L , , . d(pp) .1 a(pv) 

(pu) ■ dx + — r— dy dx\ ~ ( pv ) ■ dx = -r—- 


dxdy 




(1.20) 


( 1 . 21 ) 
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Here again, we have used Taylor’s expansion to evaluate the mass flux out from the surface at 
(x + dx) and (y + dy) from the mass flux into the surface at pc and y, respectively. 


Since there 


be 


changes in mass within the control volume: 


a(p u) 

dx 


■ dxdy + 


■ dxdy = 0 


( 1 . 22 ) 


Assuming that the fluid density p is constant, we have: 


du dv = Q 

dx dy u 


(1.23) 


This is called the continuity equation based on the conservation of mass. 


Worked example 

When the velocity field is given by 

V = (x 2 ~ y 2 )l + vj 

determine the f-velocity from the two-dimensional continuity equation. 


Substituting into 


u = x 2 -y 2 

du = 2x 
dx 

, . . ( du , dv A 

the continuity equation ^ = 0^ 


dv 

dy 


= -2x 


v = -2xy + C(x) 


get 


Worked example 

There is a two-dimensional steady flow whose velocity vector is given by 
V = 2 xyi ~ y 2 ] 

where i and j are unit vectors in the x- and y-directions, respectively. Obtain an 
expression for the pressure gradient of this flow if the fluid density is constant 
and the viscosity and gravity forces are negligible. 




u — 2xy v= -y 2 


du 

dx 


2 y 


du 

dy 


2x 


dv 

dy 


-2 y 
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Substituting these ii 


3 the Navier-Stokes equations, we get 

1 


(2*y)(2 y) + (~y 2 )(2x) = - 

0 2 _ <?£ _ 

2 * y p ax "ax 

(2xy)(0) + ( y 2 )( 2y) = 

3P = 

ay 


7 p ay 


p ax 

-2pxy2 

P dy 
-2py 3 


Laminar and turbulent flows 


The flow through a pipe remains smooth and steady below the 

critical Reynolds number, given by 

Re = — = 13,000 ( 1 . 24 ) 

However, the flow becomes fluctuating and random when the 
Reynolds number exceeds this value. Figure 1.6 shows this 
process where the transition from laminar to turbulent flow is 
demonstrated with dye injected into a pipe through a needle. 


o 


© 


We can see the change in flow behaviour as a function of the Fjgure A 6 The fk)w jn a pjpe chan g es from 

Reynolds number. Figure 1.7(a) shows a laminar flow at subcritical laminar flow (a) to turbulent flow (b) 

Reynolds number, where the dye filament stays straight through until 

the end of the pipe. As the Reynolds number is increased in Figures 1.7(b) and (c) the flow 

develops patterns, signifying the flow transition that is taking place. Finally, turbulent flow is 

reached in Figure 1.7(d) where the dye patterns seem to be random and chaotic. 


Although the Reynolds number is an important parameter affecting the transition process to 
turbulence, there are other influential factors such as the wall roughness, initial disturbance and 
external disturbance. For example, a sharp inlet to the pipe will disturb the flow, and therefore 
reduce the critical Reynolds number. Vibration of the experimental setup from the floor, as 
well as noise transmitted to the flow, will accelerate the process of transition. 






Laminar flow 

o 

Transitional flow 

o 


-'V' - ‘ ^ ~ * 

o 

Turbulent flow 

o 


Figure 1.7 Flow transition to turbulence in a pipe flow. Laminar flow (a), where the injected dye 
stays straight through the pipe; transitional flow (b), where the flow develops certain patterns; 
turbulent flow (c, d), where the flow becomes random and chaotic. 
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There is, however, no precise definition of turbulence. Indeed, it is very difficult to determine 
whether a particular flow is turbulent or not. For example, random waves that can be observed 
over the surface of the water in a swimming pool are probably not turbulence. Therefore, we 
have to look at the symptoms of the flow to determine whether it is turbulent or not. 

Symptoms of turbulence include: 

• irregularity 

• high diffusivity 

• high Reynolds number 

• three-dimensionality 

• dissipativeness 

• continuous flow. 

Every turbulent flow is different, yet they have many common characteristics as listed above. 
We must, therefore, check whether a particular flow satisfies all of these characteristics before 
declaring that it is a turbulent flow. 


Learning summary 

By the end of this section you should have learnt: 

y the Navier-Stokes equations are governing equations for fluid motion, which can be derived from 
Newton’s second law of motion; 

y the continuity equation guarantees the conservation of mass; 

y the Reynolds number indicates a relative importance of inertial force in flow motion to viscous force; 
y the Froude number signifies the importance of inertial force in flow motion against the gravity force; 

/ all flows become turbulent above the critical Reynolds number. 


1 .3 Boundary layers 


The boundary layer is a thin layer created over the surface of a body 
immersed in a fluid (Figure 1.8), where the viscosity plays a significant 
role. Due to the non-slip condition of viscous flows, the velocity at 
a solid surface is always zero. This means that the velocity gradually 
increases from zero at the wall to the freestream velocity U 0 at the 
edge of the boundary layer. This creates a thin, highly sheared region 
called the boundary layer, over a body surface in a moving fluid (or 
over a moving body in a still fluid). Over a large commercial aircraft, 
for example, the boundary layer a few millimetre thick near the 
cockpit can grow to as much as half a metre thick towards the end of 
the fuselage. 



Figure 1 .8 A boundary layer being developed 
over a wing 


Reynolds number 

Similar to pipe flows, the boundary layer over the surface of a body is initially laminar, but 
will soon become turbulent as the Reynolds number increases. Here, the Reynolds number to 
describe the state of the boundary layer flow can take any of the following forms. 


LLx U 0 8 U 0 8* Ufi 


(1.25) 


While the pipe diameter is always used to define the Reynolds number of pipe flows, the 
length scale of boundary layer flows takes either the streamwise length x along the body surface 
or one of the boundary layer thicknesses such as 8, 8* or 0. The boundary layer thickness 8 is 
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defined as the distance from the wall to the point where the boundary layer velocity reaches 
the freestream velocity. Discussions on other boundary layer thickness, such as 8* or 0, will be 
given later in this section. 


U Q> 







t ^0 ( | 

Transition ! 




Laminar Turbulent 


Figure 1 .9 Development of the boundary layer over a flat plate parallel to the flow. 

Note that the boundary layer thickness grows faster when it is turbulent. 

The boundary layer thickness over a flat plate at the streamwise length x from the leading edge 
can be obtained using the following formulae. 

8 = 5.0 x[~j 2 for laminar flow (Re* < 3 X 10 6 ) (1 .26) 

8 = 0.37 ^ j for turbulent flow (Re* > 3 X 10 6 ) (1 .27) 

The boundary layer growth depends on the flow condition, whether it is laminar or turbulent. 
The growth is much faster for turbulent boundary layers (Figure 1.9) as the diffusivity increases 
as a result of transition to turbulence (Section 1.1). The critical Reynolds number for the 
boundary layer over a flat plate is usually around 3 X 10 6 as indicated in equations (1.26) and 
(1.27). It may take quite a different value, however, depending on the initial as well as boundary 
conditions of the flow. For the boundary layer over a rough surface, for example, the critical 
Reynolds number is less than 10 6 .This means that the boundary layer transition takes place 
much earlier over a rough surface as compared to that over the smooth surface. 


Worked example 

A laminar boundary layer is being developed over a flat plate with the free- 
stream velocity of 1.5 m/s. Assuming that the pressure gradient along the plate 
is zero, obtain the distance x from the leading edge where the boundary layer 
thickness 8 becomes 10 mm. Using the critical Reynolds number R xc of 10 6 , 
determine the transition point where the boundary layer becomes turbulent. 
The kinematic viscosity and density of the fluid (air) are 1.5 X 10 -5 m 2 /s and 
1.2 kg/m 3 , respectively. 




The transition point a 


can be obtained from 
x t U 0 

R xc = = 10 6 

x ( =10^=10«X^><i0l! = 


10 [m] 


j 
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Reynolds number of living things 


Reynolds number of living things depends on their size, flight or swim speed and the medium 
they live in. The following are the Reynolds numbers of some of familiar living things. Unless 
the body size is very small or the flight or swim speed is very low, as with butterflies, the flow 
around the body of living things is most likely turbulent. 



Velocity profiles of boundary layers 


The Blasius profile is the theoretical velocity profile of the laminar 
boundary layer over a flat plate where the static pressure does not 
change along the plate. Here, the Blasius profile is independent of the 
Reynolds number of the boundary layer, as shown in Figure 1.10. 
The Blasius profile can be approximated by the parabolic velocity 
profile given by equation (1.28). 


U 0 l ( 2 


(1.28) 


For the turbulent boundary layer, however, there is no theoretical 
solution to represent the velocity profile. The turbulent boundary 
layer profile can be approximated by the one-seventh law given by 


U 0 


llV' 


(1.29) 


The atmospheric boundary layer is often simulated by the one- 
seventh law in a wind tunnel, where building or bridge models are 
tested. Indeed, this empirical law has a reasonable agreement 
with the actual profile of the turbulent boundary layer. However, 
the velocity gradient of the one-seventh law at the wall is always 
incorrect, since 


A 

dy 


= oo 


(1.30) 



Figure 1.10 Velocity profiles of laminar and 
turbulent boundary layers 
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Therefore, it cannot be used to investigate the turbulent boundary 
layer close to the wall surface. Here, we should use the logarithmic 
velocity profile instead (Figure 1.11). With the logarithmic velocity 
profile (the log law, for short) a large part of the turbulent boundary 
layer can be represented by 

4=5.751og 10 ^ + 5.5 (1.31) 

except for a very thin region near the wall (the viscous sublayer) 
where the velocity is given by the following linear profile. 

< 132 > 

Here, the friction velocity it* is given by it* = It should 

be noted that the turbulent velocity profile is dependent on the 
Reynolds number (Figure 1.10). 



Figure 1.11 Logarithmic velocity profile of the 
turbulent boundary layer 


Worked example 

An atmospheric boundary layer (8 = 80 m) with the free-stream velocity of 
U 0 = 10 m/s has the wall-shear stress of T,„ = 0.077 Pa. The kinematic viscosity 
and density of air are v = 1.5 X 10 -5 m 2 /s and p = 1.2kg/m 3 , respectively. 

(1) Estimate the thickness of the viscous sublayer. 

(2) Obtain the velocity at y = 1 m from the ground. 


(1) The thickness of the viscous sublayer is given by 
_ yu_ 


y + = v = 1 ° y = 10 f 

where, u = ^ = 0.25 m/s 


- = 0.6 X 10 _3 m = 0.6n 


(2) Using the logarithmic velocity profile, 

\ = 5.5 + 5.75 log 10 

« = «*(s.5 + 5.751og 10 ^) 

= 0.25(5.5 + 5.75 log 10 
= 7.4 m/s 

If we use the one-seventh law instead, we have 
„-l 0 (i)’. 5 , 3m/s 
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Friction velocity in turbulent boundary layers 

Logarithmic velocity profile of turbulent boundary layer is given by 
^=5.751og 10 -^+5.5 

u U *Y 

Where and — — are non-dimensional variables for the local velocity and the distance from 

wall, respectively. 

+ _ u _ \ LT ~ l 1 + - u *y _ \ ( LT~ l )L 

u ~ u ILT-'I y ~ v l L 2 T~ 1 

Using these notations, we can write the logarithmic profiles as 

« + = 5.751og 10 y + + 5.5 

Logarithmic velocity profile can represent a large part of the turbulent boundary layers and 
turbulent pipe flows except in the viscous sublayer (y + < 10), where we should use 

u w * y + + 

— or u = y 
u 2 

Since u = y in the viscous sublayer, we have 

du _ u 2 
dy~ v 

du u 2 M - u * 2 * 2 

^ = ^V=(i^) =pu =T “' 

We see that the velocity gradient within the viscous sublayer is constant. 


Effect of wall roughness 


The wall roughness can affect the laminar boundary layer by promoting an early transition to 
turbulence. For the turbulent boundary layer, the wall roughness enhances the fluid mixing to 
increase the near-wall velocity gradient, which leads to an increase in skin-friction drag. Here, 
it is important to know how rough the wall should be before it starts affecting the skin-friction 
drag of the turbulent boundary layer. 

The wall surface of the turbulent boundary layer is hydraulically smooth when s + = < 5. 


Here, s is the roughness height, 


is the friction velocity and v is the kinematic 


viscosity of the fluid. In other words, the wall surface is smooth as far as fluid dynamics of 
turbulent boundary layer is concerned if the Reynolds number s + , based on the roughness 
height e and the friction velocity u , is less than 5. This suggests that the skin-friction drag of 
the turbulent boundary layer will be increased only when s + > 5. Recalling that the thickness 
of viscous sublayer is given by y + = 10, we can say that the wall surface is hydraulically smooth 
if the roughness is completely submerged in the viscous sublayer. 


If we know the roughness height e, the Moody chart for pipe flows (and the analogous chart 
for the boundary layers) will give us the effects of roughness on the skin-friction drag in the 
turbulent boundary layer. Wall roughness should be considered always relative to the boundary 
layer thickness, so we must use the non-dimensional roughness height in studying its effect on 
the skin-friction drag. 
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Figure 1.12 Schematic view of the boundary layer. The wall surface is 
considered smooth (a); it should be considered rough (b) despite the 
identical physical size of roughness. 


Momentum integral equation 

It is possible to estimate the skin-friction drag of the boundary 
layer using the momentum integral equation. To derive the 
equation, we consider a balance of forces on a control volume 
within the boundary layer (Figure 1.13), which is being developed 
over a flat plate. Here, the line 1 and line 3 indicate the entry 
and exit of the control volume, while the streamline just above 
the boundary layer (line 2) and the flat plate (line 4) indicate the 
upper and bottom surface, respectively 

Since there is no flow across the streamline or the plate wall, the 
mass flow rate to the control volume through line 1 must be equal 
to the mass flow rate out of line 3. This gives 




Flat plate L 


Figure 1.13 Control volume used in the derivation 
the momentum integral equation 


(1.33) 


The only force acting on the control volume is the skin-friction drag over the flat plate since 
the pressure gradient over a flat plate is zero. Accordingly, the skin- friction drag must be equal 
to the change in the momentum flux within the control volume. Therefore, the skin-friction 
drag over the flat plate is given by 

D(x) = p U 2 h - p J o u 2 dy (1 .34) 

By substituting equation (1.33) into equation (1.34), we obtain 

l>8(x) p$(x) 

D(x) = pJ q U 0 udy-pJ o u 2 dy (1.35) 

= p I u(U 0 — u)dy 
Jo 


Introducing the momentum thickness 0, a measure of the momentum loss as a result of the 
boundary layer growth, which is is given by: 

“-Cul'-uft I 1 - 36 ' 

Equation (1.35) can be written as 

D{x) = pU 2 Q (1.37) 
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Therefore, the momentum thickness 0 represents the skin- friction drag D over a fiat plate. A 
differential form of this equation can be given by 

r w (x) = p Ulf x (1.38) 

if we note that 

D(x) = J t Jpc)dx (1.39) 

T w 

Using the skin-friction coefficient, Cf = , equation (1.38) can be written as 

2P U o 

C/ -2f (1.40) 

This is the Karman’s momentum integral equation, which is valid for both laminar and 
turbulent boundary layers as long as the pressure gradient is zero. The Karman’s momentum 
integral equation indicates that the local skin-friction coefficient is exactly twice the 
streamwise change of the momentum thickness. 

Another important parameter in boundary-layer theory is the displacement thickness 8 
defined as 

8 * = fo i 1 ~ ~uj dy ° r U B b* = jju g -u)dy (1.41) 

The displacement thickness is a measure of mass-flow deficit in the boundary layer due to the 
non-slip condition of viscous flows. With this concept we can treat the development of the 
boundary layer over a wall surface as if the on-coming flow were shifted by the amount of the 
displacement thickness 8 (Figure 1.14). Therefore, we can estimate the change in flow rate 
though a duct, for example, without considering the change in velocity profile. 



Figure 1.14 Simulated effect in the boundary layer using the 
displacement thickness concept 


The ratio of the displacement thickness 8* to the momentum thickness 0 is called the shape 
factor H 

H=j (1.42) 

This is a good indicator for the flow status, which can be used to check whether the flow is 
laminar or turbulent. Over a flat plate with zero pressure gradient, for example, the shape factor 
of the laminar boundary layer is H~ 2.6, while the shape factor of the turbulent boundary 
layer is H~ 1.4. For the boundary layer under transition, the shape factor takes a value between 
1.4 and 2.6. 
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The shape factor can also he used to find out if it is close to flow separation (to be discussed 
next in this section). Since the velocity profile will become tall and thin as the flow separation 
is approached, the shape factor of the boundary layer will increase in value whether it is 
laminar or turbulent. Usually this increase in the H value is quite rapid, giving warning that the 
boundary layer flow is about to detach from the wall. 


Worked example 

Obtain the displacement thickness, 
momentum thickness and the shape 
factor of the boundary layer when 
the velocity profile is given 

by jj- = j- (Figure 1.15). 



/ 

Changing the variables by setting T) = g, we have 

= &/ o (i - in)*i = is 

= 8 I 0 11(1 ~ ^ = 6 8 


I 


Formulae for the boundary layer development 


Laminar: 

8/x = 5.0 1 

8*/x = 1.721 

0/x = 0.664 f (U 0 x/v)~ 1/2 
C f = 0.664 

C D = 1.328 J (1.43) 


Turbulent: 

8/x =0.37 1 

8*/x = 0.046 
0 / x = 0.036 > 
C f = 0.058 
C D = 0.0725 J 


(U 0 x/v)- 1/5 

(1.44) 


The drag coefficient C D is a non-dimensional drag of a flat plate of length x as defined by 

C D = t , while the skin-friction coefficient Cy is a non-dimensional, local wall-share stress 
2PU 2 0 x 


IP U 2 . 
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Worked example 

Obtain the skin-friction drag and the corresponding drag coefficient up to the 
transition point of the boundary layer in the worked example on page 9. 

D=J o t w (x)dx 

= l 2 Uo' C f (x)dx 

= <>•664 

= 0.664 X^X 1.5 2 X ( . a f° x~' 2 dx 


= 0.00275 X 2/10 = 0.0179 [N/m] 

l .-. C„ = -j-^— = °' 0179 = 0.00133 

2 pU 0 L M x 1 5?. x 10 

^ Alternatively, we can use C D = 1 .328 Rx 2 to give C D = 0.00133, where R x = 10 6 . 
1 Therefore, D = C D ypL/ 2 L = 0.0180 [N/m] 

We could also compute the skin-friction drag from Karman’s momentum integral 
equation, 

0 = 0.664 xRx~ 2 = 0.664 X 10 X (10 6 )^ = 0.00664 [m] 

Therefore, 

D = pl/ 2 e = 1.2 X 1.5 2 X 0.00664 = 0.0179 [N/m] 

This gives the drag coefficient of C D = 0.00133. 


Boundary-layer equations 

Assuming that the boundary-layer thickness is small as compared to the streamwise length of 
development, we can derive a special form of the Navier— Stokes equations, called Prandtl’s 
boundary-layer equations. In order to do that, we must make a number of assumptions. 


(1) The length scale in the vertical (normal) direction of the boundary layer is much smaller 
than that of the longitudinal (streamwise) scale. In other words, 


Ay « Ax or f>>f- 


(2) The velocity scale in the vertical direction of the boundary layer : 
the longitudinal direction: 


much smaller than that it 


(3) The Reynolds number is very large, so that 
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Then, the Navier-Stokes equations (1.15) and (1.16) will take a very simple form 

du du = _ldp cPu 

dx dy p dx dy 2 


0 = - 


l dp 
P dy 


(1.45) 

(1.46) 


Equation (1.46) suggests that p = p(x), therefore the pressure is constant across the boundary 
layer. Since the pressure gradient of the boundary layer over a flat plate is always zero, i.e. 
dp/ dx = 0, this means that the pressure is constant everywhere in the boundary layer over a flat 
plate. 

The derivation of the boundary-layer equations can be done using the order of magnitude 
analysis. Here, we set 

x ~ L(x is of the same order of magnitude as the plate length L) 
y — 8 (y is of the same order of magnitude as the boundary layer thickness 8) 
p ~ pu 2 (p is of the same order of magnitude as the dynamic pressure pu 2 ) 


v ~ (8 /L) ■ u (using the continuity equation, we find that v is of the same order of magnitude as 
(8/L) ■ u) 

After replacing x, y,p, u and v with L, 8 and u, we find that the first viscous term in the 
Navier-Stokes equations is much smaller than the second viscous term. It should also be noted 
that all terms in the y-equation become zero, except for the pressure gradient term. 



Figure 1.16 Development of the boundary layer 
over a convex surface, showing a retardation 
that leads to a flow separation with reverse 
flow. The flow is from left to right. 


Effect of pressure gradient 

So far, we have studied only the boundary layers with zero pressure gradient. However, we 
can extend the boundary-layer theory to cover situations with non-zero pressure gradient. 
Figure 1.16 shows a typical development of the boundary layer over a curved surface, where a 
dramatic change in the velocity profile is taking place. Where the pressure gradient is negative, 
or favourable, i. e. dp/ dx < 0 (from point A to C in Figure 1.16), the lost energy of the 
boundary layer due to skin-friction drag can easily be replenished by the pressure force acting 
in the flow direction. Therefore, flow separation does not generally take place easily in such a 
pressure gradient condition. However, when the pressure gradient is positive, or adverse, 
dp/dx > 0 (from point C onward in Figure 1.16), the pressure force cannot easily replenish 
the lost momentum. This is because the pressure force acts against the boundary layer under 
an adverse pressure gradient. This leads to a separation of the boundary layer, or simply a flow 
separation, creating a region of flow reversal (Figure 1.16). 

The flow separation point is defined as a location where the wall-shear stress becomes zero, i.e. 


(t 


(1.47) 


Therefore, the velocity gradient at y = 0 (at the wall) also becomes zero. 
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Worked example 

A sports equipment company is required to design a new swimming 
costume for the next Olympic Games to help swimmers break the world 
record in the 200 m freestyle. Answer the following questions assuming that 
the typical swim speed is 2 m/ s. 

(1) Suggest how you might design a new swim cap. In doing so, you need to 
explain the design concept based on fluid mechanical principles. 

(2) A new swimming costume must also cover the entire arms and legs. How 
should the design concept for these parts of the costume differ from 
that of a cap? Again, your answer must be based on fluid mechanical 

8 principles. 

(3) What considerations should be given to the choice of fabric for the swim 
cap and the swimming costume? Your answer must be accompanied by 
clear and sound reasons. 


(1) The Reynolds number of the boundary layer 
R x = 


2.0 X 0,1 

1.0 x lcr 6 


a swim cap is estimated a; 
= 2.0 X 10 5 


which is a subcritical Reynolds number. Therefore, the flow is laminar. In this case, 
there are two possible strategies in designing the swim cap. 

(a) Maintain the laminar flow by making it as smooth as possible, since the skin- 
friction drag of laminar flows is much lower than that of turbulent flows. 

(b) Promote turbulent flow by tripping the boundary layer, which can reduce the 
pressure drag of the swimmers head by moving the separation point further 
downstream. 

(2) To increase the thrust, the drag on arms and legs must be increased by 

(a) making the surface of the swimming costume covering the arms and legs rough, 
thereby increasing the skin friction drag, or 

(b) making the surface smooth as possible, thereby allowing the laminar flow to 
separate early. 

(3) We must choose the fabric of the swimming costume carefully to achieve these 
objectives: a smooth fabric where we want laminar flow, and a rough fabric in certain 
parts of the costume to promote turbulence. 


Learning summary 

By the end of this section you should have learnt: 

/ viscous fluid does not slip at a solid wall surface. This is called the non-slip condition of flow motion; 

/ the boundary layer is a thin fluid layer near a solid wall surface, where the velocity is less than the 
freestream velocity; 

/ the momentum thickness signifies the loss of momentum in the boundary layer due to skin-friction 
drag; 

/ the displacement thickness is a measure of mass flow deficit in the boundary layer; 

/ the boundary layer equations are a simplified form of the Navier-Stokes equations; 

/ flow separation occurs over a curved surface when the static pressure increases in the flow direction. 
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1.4 Drag on immersed bodies 


Pressure drag 

While the friction drag D f - rjc results from the viscous action of fluids on the body surface, the 
pressure drag D pres comes from the static pressure distribution around the body, mainly due to 
boundary-layer separation. The total drag acting on immersed bodies in incompressible flows, 
therefore, consists of the friction drag and the pressure drag. We can write 

D tot (total drag) = D fric (friction drag) + D pres (pressure drag) (1 .48) 

The relative importance of D pres to D &ic depends on the body shape as well as the Reynolds 
number. When the immersed bodies are streamlined, the friction drag dominates the total drag. 
When the non-streamlined bodies (bluff bodies) are placed in a fluid flow, however, the total 
drag is dominated by the pressure drag, and the contribution of the friction drag is usually 
negligible. 


Drags can be expressed in terms of the non-dimensional drag coefficient, which is given by 


C D 


D 

1PV 2 A 


(1.49) 


where A (which must be specified in quoting C D values) is either the frontal area (for thick 
bodies, such as motor cars) or the plan- form area (for long bodies, such as aircraft wings). 


In terms of drag coefficient, equation (1 .49) can be written as: 


C Dtot - C DfHc 


+ C Dp r 


(1.50) 


Flow around a circular cylinder 

For a circular cylinder with radius a and length b, the pressure drag is given by 

Dpres = l^ab(p - Poo ) cos 0 d0 (1 .51 ) 


whose drag coefficient is given by 


f 2lT afe(p — Poo ) cos 0 d0 
C ° preS = i o ( 2ab)\pV 2 

1 f 2lr (P ~ Poo) fn 

= 7T — ; cos0 d0 

2 Jo I p F 2 


hi: 


where the pressure coefficient C p is given by 


C p 


P~ Poo 

ipv 2 


(1.53) 


It should be noted that the frontal area of circular cylinder 
(2 ab) is used to non-dimensionalize the drag to give the 
pressure drag coefficient. This equation suggests that C Dpress 
can be obtained by integrating the streamwise component 
of C p over the circular cylinder. The cylindrical coordinate 
system being used in the computation is shown in Figure 
1.17, where the angle 0 is measured clockwise from the 
frontal stagnation point. 



Figure 1.17 The coordinate system used for the integral 
of static pressure around a circular cylinder to give the 
pressure drag. Here, p is the static pressure over the 
cylinder surface and p t is the freestream pressure. 
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Figure 1.18 compares the distribution of pressure coefficient C p over a circular cylinder 
between the laminar flow and the turbulent flow. It should be noted that both C p curves 
are asymmetric with respect to 0 = 90°, indicating that the static pressure over the front of 
the circular cylinder is much higher than that in the rear. The integrated pressure difference 
between the front and rear surfaces gives the pressure drag acting on the circular cylinder. 


Figure 1.18 also shows a significant difference in the static 
pressure distribution between the turbulent flow and the 
laminar flow. While the static pressure for the laminar flow 
stays near the minimum value of C p = — 1 .0 in the rear of 
the circular cylinder, the turbulent flow recovers to a much 
greater value of C p = —0.4 after reaching the minimum 
value of C p = —2.1 at around 0 = 75°. This reflects a 
small Cq value of 0.3 for the turbulent flow as compared 
to Co =1.2 for the laminar flow (Figure 1.1 9). The main 
reason for the smaller Co value for turbulent flow is that 
the flow separation takes place much further downstream 
due to greater mixing capability of the turbulent flow. As a 
result, the wake region in the downstream of turbulent flow 
separation is narrower than for laminar flow (Figure 1.21). 

However, inviscid theory gives a symmetric C p curve 
(Figure 1.18), suggesting that the drag on a circular 
cylinder is zero for zero viscosity fluids. Certainly this is 
not a realistic assumption in calculating the drag force on 
immersed bodies. Indeed, inviscid theory cannot impose 
the non-slip condition on the wall, so there will be no 
boundary layer development or flow separation over the 
immersed bodies. 



0 

Figure 1.18 Non-dimensional pressure distribution over 
a circular cylinder, where the C p curve for the laminar 
and turbulent flow are compared with the solution of 
inviscid flow. 


Drag of bluff bodies 

As has been previously suggested, the drag coefficient C D of 
immersed bodies is a function of the Reynolds number. The drag 
coefficient is gradually reduced with an increase in the Reynolds 
number as seen in Figure 1.19. Once the Reynolds number 
reaches the critical value (Re ~ 5 X 10 5 for circular cylinders 
and spheres with smooth surface) the drag coefficient will drop 
suddenly. This is the transition point where the flow around the 
immersed bodies will become turbulent from laminar flow. 


1.5-, 




Figure 1.19 Drag coefficient of circular 
cylinder and sphere as a function of the 
Reynolds number, showing that the C D 
value reduces as the transition takes place. 
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Figure 1 .20 Drag coefficient of a circular cylinder, showing that the 
transition takes place at lower Reynolds number with an increase in 
the surface roughness. 
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If the wall surface is rough, the transition to turbulences over a circular cylinder takes place at 
lower Reynolds number (Figure 1.20), but the drag coefficient C D for the turbulent flow (after 
transition) is greater than that with a smooth surface. Figure 1.20 also shows that the critical 
Reynolds number reduces with an increase in the roughness ratio s/d. The flow over a sphere 
is qualitatively similar to that over a circular cylinder. 


Worked example 

Obtain the drag force on a baseball of 73 mm diameter at the critical Reynolds 
number assuming that the flow around the ball is turbulent. The density and 
kinematic viscosity of air are 1.2 kg/m 3 and 1.5 X 10 _5 m 2 /s, respectively. 

From Table 1.2, we find that C D ~ 0.2 for a sphere when the flow is turbulent, where the 
critical Reynolds number is Re = 3 X 10 5 . Therefore, the drag force on the ball can be 
obtained by 

D = C d ^U 2 A 

where, 

A = "4 d2 = f X (0-073) 2 1 0.0042 [m 2 ] 

U = Re^ = (3 X 10 5 ) X 1 ' 5 0 ^ 7 1 3 ° 5 = 61.6 [m/s] 

D = 0.2XyX (61.6) 2 X 0.0042 = 1.91 [N] 


The dimples on a golf ball can reduce the pressure drag by making the ball surface rough. This 
reduces the transition Reynolds number by artificially forcing (tripping) the boundary layer 
to turbulent flow at low Reynolds numbers. As a result, the wake becomes narrower as can be 
seen in Figure 1.21. Although the friction drag is increased in this case, the total drag of the golf 
ball is reduced. This is because the golf balls are bluff (non-streamlined) bodies, where D pres is 
much greater than D lric . 



Figure 1.21 Comparison of laminar separation with turbulent 
separation, showing that the separation point moves further 
downstream when the boundary layer becomes turbulent. This 
reduces the pressure drag by making the wake narrower. 
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Tables 1.1 and 1.2 give the drag coefficient of two-dimensional and three-dimensional bodies, 
respectively. It should be noted here that the drag coefficient of sharp-edged bodies, such as 
squares and cubes, is insensitive to the Reynolds number since the flow is always separated at 
the sharp edges. In other words, the C D value of sharp-edged bodies remains constant whether 
the flow is laminar or turbulent as long as the Reynolds number is greater than 10 4 . 


Shape Cq Shape Cq I 

Plate: — - | 

2.0 

Half-cyiinder: 

*a 

1.2 

Square cylinder: — - | 

2.1 


- D 

1.7 


1.6 

Equilateral triangle: 

-<] 

1.6 

Half tube: — >- ^ 

1.2 


-> 

2.0 


2.3 


Laminar 

Turbulent 

Elliptical cylinder: 1:1 

- 

o 

1.2 

0.3 

2:1 

— 

o 

0.6 

0.2 

4:1 

— - 

<r p 

0.35 

0.15 

8:1 


< => 

0.25 

0.1 


Table 1 .1 Drag coefficient of two-dimensional bodies at Re > 1 0 4 


60° cone: 
Disk: 

Cup: 


n 

0 

1 

c 


CD? 


0.90 

0.85 

0.87 


Laminar Turbulent 


0.47 

0.27 

0.25 


Table 1.2 Drag coefficient of three-dimensional bodies at Re > 10 4 
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Worked example 

The fork ball is a baseball pitch thrown like a straight ball but with little or 

no rotation, where the ball initially travels straight but falls sharply as it gets 

closer to the batter who is standing about 18 m away from the pitcher. 

(1) Draw a figure showing the drag coefficient of the baseball as a function of 
the Reynolds number by considering the baseball as a smooth sphere. 

(2) Obtain the drag force on a baseball with 73 mm diameter at the critical 
Reynolds number assuming that the flow around the ball is turbulent. The 
density and kinematic viscosity of air are 1.2kg/m 3 and 1.5 X 10 -5 m 2 /s, 
respectively. 

(3) By what percentage does the drag force change if the flow becomes 
laminar rather than turbulent? 

(4) Explain the behaviour of the fork ball as described above using the 
principles of fluid mechanics. 

(5) Discuss how the pitcher should adjust the delivery of the fork ball for it to 
remain effective if the ball surface becomes rough during a game. 


'-+Re=^- 
10 4 10 5 | 10 6 

Critical Re = 3 X 10 5 

Figure 1.22 

(2) C D ~ 0.2 for a sphere when the flow is turbulent, where 
_ n tuI 2 


- = 0.0042 m 2 and U = Re 7 = 61.6 m/s 


PA 


D = 0.2XyX (61.6) 2 X 0.0042 = 1.91 N 


5 0.5. Therefore, the drag will be 


(3) When the flow around the sphere is laminar, 
increased to 2.5 times (150% increase). 

(4) The fork ball is the result of reverse transition (from turbulent to laminar rather than 
usual laminar to turbulent route) of flow around the ball. During this transition, the 
ball will experience a 150% increase in drag, resulting in a sharp drop near the batter. 

(5) As shown in Figure 1.20 for a circular cylinder (similar for a sphere), the drag 

increase will be smaller when the surface is rough. Therefore, the amount of drop 
will be reduced as a result of ball roughness. The critical Reynolds number for flow 
transition will be lowered so that the pitcher must throw a fork ball with a lower 
initial speed for it to be effective. 
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I Worked example 

A man jumped from an airplane with a parachute of 7.3 m in diameter. 
Assuming that the total mass of the man and parachute is 80 kg, calculate the 
speed of descent when he reaches terminal velocity. 

The drag coefficient of parachute is C D = 1.2 regardless of the Reynolds number, as it is 
a “sharp-edged” body. 

The terminal velocity will be reached when the drag of the parachute is balanced by the 
weight of the parachute and the man, i.e. D = W. 

Here, 

D = C D \pAV 2 = 1.2 X (1.2/2) X (tt/4) X (7.3) 2 V 2 [N] 

W = Mg = 80 X 9.8 [N] 

t/2 80 X 9.8 0 , n . 2/ 7 , 

F 1.2X (1.2/2) X (tt/4) X (7.3) 2 26-0 [ m /s ^ 

^ to give V = 5.1 [m/s] 


Streamlining strategy 

An important strategy in reducing pressure drag D pres of immersed bodies is to streamline them, 
by shaping the bodies in such a way as to move the flow separation point further downstream. 
This will effectively reduce the width of wake (the area in the downstream of flow separation), 
leading to a reduction of the low pressure region in the rear of the immersed bodies. 

Figure 1.23 shows a procedure for streamlining a rectangular cylinder that has sharp corners. 
The drag can be easily reduced to nearly a half by rounding the front corners of a cylinder, 
which reduces the drag coefficient C D from 2.0 to 1.0. A more dramatic reduction in drag can 
be obtained by tapering the rear corners, resulting in a reduction of drag to nearly one seventh 
of its original. It is surprising to observe that a fully streamlined cylindrical body is equivalent 
in terms of the total drag with a circular cylinder one tenth of its width. This shows the 
effectiveness of streamlining strategy in reducing drag by tapering the rear of immersed bodies. 



Figure 1 .23 Reducing the drag coefficient through streamlining strategy. The drag coefficient of a rectangular cylinder (a) 
can be reduced by tapering the front (b) and rear (c) of the body. A circular cylinder, as shown in (d), has a diameter that is 
nearly one tenth of the width of the streamlined cylinder (c), yet both have the same drag. With its cylindrical wing struts, 
no wonder a biplane (e) cannot fly very fast. 
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The width of the wake region can be reduced if the flow separation is moved back towards the 
rear of the body. In practice, however, it is often difficult to do this as it may reduce the capacity 
(volume) of the vehicle. Instead, trucks can benefit much by attaching a deflector on top of the 
cab (Figure 1.24), which reduces a large separation region in front of the trailer, leading to 
a drag reduction of up to 20%. 



Figure 1.24 A deflector can reduce the pressure drag of a truck by 20%, by steering the 
streamlines away from the frontal surface of the trailer. If there is no trailer attached to the cab, 
however, there will be a large increase in drag. 


Learning summary 

By the end of this section you should have learnt: 

/ pressure drag is a result of the boundary layer separation, where the static pressure difference is 
created between the front and rear of the bodies; 

/ drag coefficient of immersed bodies is reduced with an increase in the Reynolds number when the 
flow is laminar; 

/ drag coefficient of immersed bodies is suddenly reduced at the critical Reynolds number when the 
flow becomes turbulent; 

/ surface roughness will reduce the critical Reynolds number of immersed bodies, thereby reducing 
their drag at lower Reynolds number; 

/ streamlining is an effective strategy for reducing drag, where the immersed bodies are rounded at 
the front and tapered at the rear. 


1.5 Flow through pipes and ducts 

Friction factor 

The friction factor/ of pipe flows, as defined by 

hj 

f=T^ ( 154 > 

d 2g 

is a non-dimensional form of the frictional head loss hr. The friction factor is a function of the 
Vd . e 

Reynolds number R d = as well as the relative surface roughness where V is the bulk 
velocity, d is the pipe diameter, v is the kinematic viscosity of fluid and s is the typical surface 
roughness height. 
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For laminar flows (R d < 2 X 10 3 ) it can be shown that the friction factor is the only function 
of the Reynolds number, where 


/ = 


64 

Rd 


(1.55) 


This is called the Darcy— Weisbach equation for laminar pipe flows. It should be noted that the 
surface roughness does not affect the friction factor for laminar pipe flows. 

For turbulent pipe flows, Colebrook gave the following formula for f covering a wide range of 
Reynolds number and surface roughness 


r- 2 - oi °# + tl) 

Although accurate in presenting the friction factor for both transitional and fully turbulent pipe 
flows (R d > 4 X 10 3 ), this formula is difficult to use in practice since the friction factor is not 
given in a closed form. In other words, iteration is required to obtain the friction factor from 
this equation for a given Reynolds number and roughness ratio. It is for this reason Moody has 
presented a chart where the friction factor can be easily read. This is called the Moody chart, 
where the friction factor is given as a function of the Reynolds number R d 
and the roughness ratio ^ (Figure 1.25). Following a curve of constant ^ value (as shown on 
the right-hand side of the chart) to meet a constant Reynolds number line, one can read off 
the friction factor on the left-hand side of the chart. Typical surface roughness e for pipes and 
ducts, from iron to concrete, can be found in Table 1.3. 



Figure 1.25 Moody chart (taken from F. M. White, 2008, Fluid Mechanics, New York: McGraw Hill 
and reproduced with permission of the McGraw-Hill Companies) 
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Material 

Condition 

s mm 

Uncertainty, % 

Steel 

Sheet metal, new 

0.05 

±60 


Stainless, new 

0.002 

±50 


Commercial, new 

0.046 

±30 

Iron 

Cast, new 

0.26 

±50 


Wrought, new 

0.046 

±20 


Galvanized, new 

0.15 

±40 

Brass 

Drawn, new 

0.002 

±50 

Plastic 

Drawn tubing 

0.0015 

±60 

Glass 

- 

Smooth 


Concrete 

Smoothed 

0.04 

±60 


Rough 

2.0 

±50 

Rubber 

Smoothed 

0.01 

±60 

Wood 

Stave 

0.5 

±40 


Table 1.3 Typical surface roughness in pipe and channel flows 


There are two different definitions for the friction factor. The Darcy friction factor is used 
throughout this textbook, which is defined by 

hr 

(1 

d 2g 

Meanwhile, the Fanning friction factor is given by 

/f 


n f 

d 2g 


which contains a factor of 4 in its definitions. Since the friction head is given by 

Ap 

h f = — 


A p = ~, 


(1.58) 


±4 L 

pg d w 

Therefore, / = * 2 =4-^ = 4 C f 

^v 2 

d 2g 2 

Therefore, the Darcy friction factor is related to the skin-friction coefficient through 

f=4C f (1.59) 

Using this relationship, the friction velocity u of the pipe flow can be given in terms of 
friction factor/ as 

f-ivf ° r |1 - 601 

If we use the Fanning friction factor, we have/; = (/instead. 
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Minor losses 

Whenever there are changes in velocity magnitude or direction in a pipe or duct system, there 
will be associated pressure drops, called minor losses. The minor losses are typically found at 

• the entrance to the pipe or exit 

• sudden expansion or contraction 

• bends 

• valves. 

The minor losses are caused by the internal flow separation as a result of changes in the 
magnitude or direction of the flow through the pipes or ducts. These are similar to the pressure 
reductions along the immersed bodies as a result of boundary layer separation. Although they 
are called minor losses, the pressure drops can be a significant part of the total pressure drop 
when the pipe or duct has a short straight section. 

The minor head loss h m in a duct or pipe system is expressed by 
V 2 

h m = K-ir- (1.61) 

2g 

where, K is the minor loss coefficient or the FC-factor. Therefore, the total head loss through the 
pipe system is given by summing the frictional head loss hj and all the minor losses, to give 

h t „t = h f +J2h m (1.62) 

The frictional head loss hj through a pipe or duct is given from equation (1.54) as 

i 1 - 63 ' 

Examples of flow causing minor losses are shown in Figure 1.26, where the flow direction 
changes through a circular and square bend. Both flows are separated at the bend, but the 
degree of flow separation and the turbulence being produced are very different. The turbulence 
is much greater in a flow through a square bend than through a circular bend. This explains 
why the minor loss is much greater for the flow around a square bend. Therefore, the use of 
square bends should be avoided in a pipe or duct system. 


o o 

Figure 1.26 Flow through a circular (a) and square (b) bends 

The minor loss coefficient for mitre bends including that of the square bend is given in 
Figure 1.27. There is no Reynolds number dependency on this value since the flow is always 
separated at the sharp corner. Figure 1.28 shows the minor loss coefficient of circular bends 
at Re = 10 6 as a function of the bend-radius-to-pipe-diameter ratio -j and the bend angle 0 b . 
Here, the minor loss coefficient depends on the Reynolds number, so a correction factor given 
in Figure 1 .29 should be applied to this value. 

Other examples for large minor loss in a pipe and duct system include sudden contraction and 
expansion, where the flow separation takes place at the junctions (insets in Figure 1.30). The 
FC-factors are a function of the rate of contraction or expansion. 
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Figure 1.28 The loss coefficient of circular bends 
at Re = 10 6 . For other Reynolds numbers this 
coefficient must be multiplied by the correction 
factor given in Figure 1 .29 




Reynolds number 


Figure 1 .29 The Reynolds number correction factor for Figure 1 .30 Minor losses for the flow through a sudden 

circular bend loss coefficient contraction or expansion as a function of ^ 


Figure 1.31 illustrates some commercial valve geometries. Typical X-factor for the gate and disk 
valves is X ~ 0.2 when they are fully open, while it is X ~ 4 for the globe valve. Valves are the 
main source of minor losses in a pipe system as one can see in Figure 1.32, showing typical 
values of the X-factor when valves are partially open. It is shown that the head loss in the pipe 
or duct system will he increased by more than 100 times when a gate valve is closed by 75%. 
We must he careful, therefore, in the selection and use of the valves in a pipe and duct system. 

There are further sources for minor losses in a pipe system. Figure 1.33 shows the entry losses 
for different entry geometry. As we would expect, the X-factor for the pipe entry is a function 
of the relative radius and length of the entry. Note that the X-factor is always unity for a sharp 
exit from the pipe. 
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Figure 1.31 Commercial valve geometries, (a) gate valve; (b) globe valve; (c) angle valve; (d) 
swing-check valve; (e) disk-type gate valve (F. M. White, 2008, Fluid Mechanics, New York: 
McGraw Hill. Reproduced with permission of The McGraw-Hill Companies) 



Figure 1 .32 Typical minor losses of valves when they 
are partially open (F. M. White, 2008, Fluid Mechanics, 
New York: McGraw Hill. Reproduced with permission 
of The McGraw-Hill Companies) 



Figure 1 .33 Entry losses to re-entrant inlets (a), 
rounded inlets (b) and bevelled inlets (c). Note that 
the exit losses are K = 1 for all exit shapes. (F. M. 
White, 2008, Fluid Mechanics, New York: McGraw 
Hill. Reproduced with permission of The McGraw-Hill 
Companies) 
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Hydraulic diameter 

When the pipes and ducts are not circular, we can use the hydraulic diameter D h in place for the 
diameter of the circular pipe to calculate pipe losses. The hydraulic diameter is defined by 

H 4 x cross-sectional areas „ g „ 

h wetted perimeter 

With this concept, we can obtain the friction factor of non-circular pipes and ducts using the 
Moody chart just as we have obtained the friction factor for a circular pipe from it. Here, the 

VD h e 

Reynolds number and the relative roughness can be defined by — ~ — and respectively, using 

the hydraulic diameter D h instead of d, while the frictional head loss is given by 

i 1 - 65 ' 

The hydraulic diameter of a circular pipe is its physical diameter; that is D h = d. 
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Secondary flows 

Secondary flows can be observed in non-circular pipes and 
ducts, where the fluid near the corners has a movement 
across the mean flow direction. The secondary flows are 
driven by the turbulent-shear stresses, which act towards the 
corners of non-circular ducts. As a result, the isovelocity 
contours are similar in shape to the non-circular pipe or duct 
cross sections. This is why the hydraulic diameter concept 
works well for turbulent pipe or duct flows. Figure 1.37 
shows the isovelocity contours (a) and secondary motions (b) 
across a triangular pipe section normal to the mean flow. 

Secondary flows in non-circular pipes and ducts are 
turbulent flow phenomena, so there are no secondary 
motions for laminar flows. There are other types of 
secondary flows, which are caused by the centrifugal 
force acting on the flow around the circular bend. These 
secondary flows can be observed even for circular pipes 
or laminar flows. 



o o 


Figure 1.37 Isovelocity contours (a) and secondary motions 
(b) within non-circular ducts. Note that the isovelocity 
contours are similar to the duct shape. This is caused by 
the turbulent shear stresses, setting up the flow motion 
(secondary flow) normal to the mean flow direction. 


Worked example 

A section of pipe is made of sheet steel, whose cross section has the shape of 

an equilateral triangle. 

(1) If one side of the triangle is h, obtain the expression for the hydraulic 
diameter of this pipe. 

(2) Air of density p = 1.2 kg/m 3 and viscosity |i = 1.8 X 10 _5 kg/ms at 
atmospheric pressure is supplied by a blower through 30 m pipe with the cross 
section of an equilateral triangle of side 22.5 mm. Obtain an expression for the 
frictional head loss in terms of the volumetric flow rate and the friction factor. 
You can assume that the pipe section is straight and is placed horizontally. 

(3) If the blower is rated at 746W of output power, obtain the maximum flow 
rate through the pipe. 

I (1) The wetted perimeter of a triangle of side h is 3 h, and its cross-sectional area is given 
by 
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(2) Since h = 22.5 X 10~ 3 m,A c =2.19 X 10” 4 
The friction head loss is given by 


= —h 2 
3 n 

n 2 and D h = 1.30 X l(T 2 ir 


= = f-LAAL = fr n 

J D h 2g J D h 2g D h 2 g J U 


30 X (2.19 X 10“ 4 )“ 2 


r/Q 2 


(1.30 X 10“ 2 ) X 2 X 9.8 J 
= 2.45 X 10 9 /Q 2 

(3) The power is given by 

P = 746 = p ghfQ = 1.2 X 9.8 X (2.45 X lO^/Q^Q 
/Q 3 = 2.59 X 10“ 8 

Here, the pipe roughness ratio is given by 


0.05 
' 13.0 


= 3.85 X 10“ 3 


The Reynolds number is 


■ 


The flow rate Q can be obtained by iteration through the following process. 

(1) Assume an appropriate flow rate Q. 

(2) Compute s/D h and Re using equations (ii) and (iii), respectively. 

(3) Look up/ against e/D h and Re using the Moody chart. 

(4) Check to see if/ and Q satisfy equation (i). 

(5) If not, change the flow rate Q and repeat the above until it converges. 
You should get Q = 0.0094 m 3 /s. 




Learning summary 

By the end of this section you should have learnt: 

/ the friction factor of a pipe flow is a function of the Reynolds number and the surface roughness 
ratio, which can be obtained from the Moody chart; 

/ whenever there are changes in velocity magnitude or direction in a pipe or duct system, there will 
be associated pressure drops, called minor losses; 

/ the total head loss through the pipe system is obtained by adding the frictional head loss and all 
the minor losses; 

/ when the pipes and ducts are not circular, we can use the hydraulic diameter D h to calculate the 
pipe losses; 

/ the secondary flows in non-circular pipes and ducts are driven by the turbulent-shear stresses 
which act towards the corners of non-circular ducts. 
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1.6 Dimensional analysis in fluid dynamics 

Non-dimensional numbers 

Non-dimensional numbers such as the Reynolds number, Re and the Froude number, Fr are 
important in understanding the characteristics of the flow as well as in comparing the flow 
behaviour with others. These non-dimensional quantities can be obtained as a ratio of two 
different physical quantities. For example, the Reynolds number represents the ratio of inertial 
force to viscous force, while the Froude number is given as a ratio of inertial force to gravity 
force. The definition of non-dimensional numbers appearing in many fluid flows and their 
physical ratios are summarized in Table 1.4. 


Parameter 

Definition 

Ratio 

Situation 

Cavitation number 
(Euler number) 

Ca -~^LF 

Pressure 

Inertia 

Cavitation 

Drag coefficient 

r - D 

Drag force 

Aerodynamics, 

° |p U 2 A 

Dynamic force 

hydrodynamics 

Froude number 

Fr = If 

g L 

Inertia; 

Gravity 

Free-surface flow 

Grashof number 

Qr _ PA7 ]jL 3 p 2 
P 2 

Buoyancy 

Viscosity 

Natural convection 

Lift coefficient 

r, - L 

Lift force 

Aerodynamics, 

yu 2 A 

Dynamic force 

hydrodynamics 

Mach number 

Ma = 7 

Flow speed 
Sound speed 

Compressible flow 

Pressure coefficient 


Static pressure 
Dynamic pressure 

Aerodynamics, 

hydrodynamics 

Prandtl number 


Dissipation 

Conduction 

Heat convection 

Reynolds number 

J3 

CD 

II 

Inertia 

Viscosity 

Viscous flow 

Roughness ratio 

L 

Wall roughness 
Body length 

Turbulent, rough walls 

Strouhal number 

St = TT 

Oscillation 
Mean speed 

Oscillating flow 

Weber number 

we-^ 

Inertia 

Surface tension 

Free-surface flow 


Table 1.4 Non-dimensional numbers in fluid dynamics 


The Strouhal number, St, is a non-dimensional frequency of flow oscillation, appearing in 
such a situation as Karman’s vortex street (Figure 1.38). As seen in Figure 1.38 the Strouhal 
number of vortex shedding from a circular cylinder is nearly constant (St ~ 0.2) over a large 
range of the Reynolds number. 
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Figure 1.38 Vortex shedding 
from a circular cylinder, (a) Flow 
visualization; (b) Strouhal number 
(non-dimensional frequency for 
vortex shedding) as a function of the 
Reynolds number. 


Q 0.4^ 



10 10 2 10 3 10 4 10 5 10 6 10 7 
Re 


Worked example 

A chimney of 2 m in diameter and 40 m high is subjected to 22.35 m/s storm 
winds, where the density and dynamic viscosity of air are 1.225 kg/m 3 and 
1.78 X 10 -5 Pa • s, respectively. 

(1) Obtain the Reynolds number of the flow around the chimney, assuming 
that the velocity of the storm winds is uniform along the chimney. Is the 
flow laminar or turbulent at this Reynolds number? 

(2) Estimate the frequency of the vortex shedding from the chimney. 


(1) The Reynolds number is given by 


Re 


Ud 22,35 X 2 

(x/p 1.78 X 10“ s /1.225 


= 3.08 X 10 6 


Therefore, the flow around the chimney is turbulent. 


(2) From Figure 1.38, we get the value for the Strouhal number as 
fd 

St = jj~ 0.25 


IJ 22 35 

/= Stj = 0.25 X = 2.8 Hz 


Therefore, 
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Mach number, Ma, is a ratio of the flow speed to the speed of sound, 
indicating the compressibility effect of the fluid. Figure 1.39 presents 
the pictures of a flying bullet visualized by the shadowgraph technique, 
showing the shock waves over the bullet body at a high Mach number. 

Weber number, We, indicates a relative magnitude of the inertial 
force to the surface tension. This non-dimensional number becomes 
important when there is significant effect of surface tension in the 
flow phenomena, such as the droplets and in capillary flows. Figure 
1 .40 shows a sequence of pictures to show the break-up of droplets, 
where the Weber number plays a significant role in determining its 
behaviour. 


Grashof number, Gr, is defined as a ratio of the buoyancy force to the viscous force. The 
Grashof number becomes important when a strong buoyancy effect is considered in a situation 
such as natural convection. Figure 1.41 shows the pattern of thermal convection between 
two concentric cylinders visualized by cigarette smoke, where the outer cylinder is cooled by 
14.5 K. This gives the Grashof number of 120000 based on the gap distance. 



Figure 1.40 The break-up of droplets is 
governed by the Weber number 



shockwave around a bullet 



Figure 1.41 The streamlines in convection between concentric 
cylinders, which is governed by the Grashof number 

Drag coefficient C D and lift coefficient C L are non-dimensional numbers used for lift- 
generating bodies, such as wings and propellers. They represent the drag force and the lift force, 
respectively, acting on the bodies, relative to the dynamic pressure of the free stream. Figure 
1.42 shows the lift coefficient (a) and drag coefficient (b) of the NACA 0012 airfoil. As the 
angle of attack is increased beyond the stall angle (around 15°), there will be a flow separation 
(c) to reduce the lift coefficient dramatically with an associated increase in the drag coefficient. 
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O R c = 2.88 x 10 6 


0.3mm diameter ballatini 
distributed fromT.E. to x*/ c 




Figure 1.42 Lift (a) and drag coefficient (b) of the NACA 0012 airfoil (c) 

Buckingham’s theorem 

The Buckingham tt theorem, or the tt theorem for short, is a systematic method for finding 
relevant non-dimensional quantities. It applies not only to the problems in fluid mechanics, but 
to many other areas of engineering and science. Buckingham s theorem consists of two parts. 
The first gives the number of non-dimensional quantities to account for a given flow problem, 
and the second determines each of the non-dimensional quantities involved. 

Part 1 

If there are n physical variables describing a physical process which contains m basic dimensions, 
where n> m. Then, this physical process can be described only with (n — m) independent 
non-dimensional quantities: 

TTi, TT2, ..., TT„_ m (1.66) 
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Here, the physical variables are those to indicate the characteristics of the physical quantity or 
status (e.g. velocity, pressure, density, etc.), while the basic dimensions are given by M (mass), L 
(length), T (time) or 0 (temperature). 

Part 2 

In order to find the non-dimensional quantities tt 1 ,tt 2 , . . . , tt „ _ m , we first pick m physical 
variables that do not form any of TTj themselves. Then, the product of these variables with one 
additional variable from ( n — m) remaining physical variables will give the non-dimensional 
quantities. 

For example, if there are five physical variables iq, v 2 , v 3 , V 4 and v 5 which contain three basic 
dimensions M, L and T, then there are two (5 — 3 = 2) non-dimensional quantities to describe 
this physical process (Part 1). Picking iq, v 2 and v 3 physical variables, the non-dimensional 
quantities tt 1 and tt 2 can be given by multiplying iq, v 2 and v 3 by v 4 and v 5 , respectively, in such 
a way that ttj and tt 2 will become non-dimensional (Part 2) by choosing the powers a, b, c, d, 
e,Jg and h appropriately. 

TTi = (v 1 ) a (v 2 ) i ’(v 3 ) c (v 4 ) <i 
= (vi) e {v 2 y(v 3 ¥(v 5 ) h 

It is strongly advisable to go through worked examples carefully to understand Buckingham’s 
theorem, where a detailed procedure to find the number of non-dimensional quantities as 
well as to determine their non-dimensional forms are given. To start the process, we need to 
understand the flow problem with a view to identifying relevant physical variables. Table 1.5 
lists the physical variables appearing in many flow problems together with their dimensions, 
which can help apply Buckinghams theorem. 


Quantity 

Symbol 

Dimensions 

Acceleration 

dV/dt 

LT- 2 

Angle 

e 

None 

Angular velocity 

<0 

T” 1 

Area 

A 

L 2 

Density 

P 

ML” 3 

Force 

F 

MLT -2 

Kinematic viscosity 

V 

L 2 T _1 

Length 

L 

L 

Mass flow 

m 

MT" 1 

Moment, torque 

M 

ML 2 T -2 

Power 

P 

ML 2 T- 3 

Pressure, stress 

P,<T 

ML _1 T _2 

Speed of sound 

a 

LT” 1 

Strain rate 

s 

T” 1 

Surface tension 

Y 

MT- 2 

Temperature 

T 

e 

Velocity 

V 

LT” 1 

Viscosity 

P 

ML-T- 1 

Volume 

V 

L 3 

Volume flow 

Q 

L 3 T _1 

Work, energy 

W, E 

ML2T- 2 


Table 1.5 Dimensions of physical variables used in fluid dynamics 
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Worked example 

Obtain the non-dimensional quantities for a pipe flow by considering the 
following physical variables: 

Q[L 3 T _1 ] a [L] l [L] A p [ML -1 T“ 2 ] p, [ML -1 T _1 ] 



m 

Flow rate Q 

Figure 1.43 

There are five (n = 5) physical variables Q, a, l, A p and |x, and three (m = 3) basic 
dimensions M, L and T. Therefore, the number of non-dimensional quantities in this flow 
problem will be two (n — m = 2). 

In order to find these two non-dimensional quantities Tti and TT 2 , we firstly pick three 
(m = 3) physical variables Q, l and A p. It is guaranteed that the products of these physical 
variables do not form any non-dimensional quantities, since only A p contains the basic 
dimension M. 

Therefore, we set at, = Q 11 • l b . A p c • p/' tt 2 = Q'' ■ P ■ Ap ? ■ a h 

We now have to determine a, b, c, d,e,fg and h to make and Tt 2 non-dimensional. 
Substituting the basic dimensions into above, 

Tt! = [V a ■ T - fl ] ■ [L b \ ■ [M c L“ c T- 2c ] ■ [M d L~ d T- d ] 
tt 2 = [L 3e ■ T-t] ■ [Lf] ■ [M-? LST- 2 e] \L h ] 

In order for these quantities to be non-dimensional, we must solve the following: 

L:3a + b-c-d = 0 3e+f-g + h = 0 
T: —a — 2c — d = 0 - e - 2g = 0 

M:c + d = 0 g= 0 

The solution is given by 

g = 0,e = 0 ,/= - h,c= - d,a = d,b = -3d 
Therefore, the non-dimensional quantities can be given by 

tti = [Q-r 3 - Ap -1 -^ tt 2 =[l-a~ i y 

where d and / are arbitrary constants. We cannot determine these constants with the 
number of available equations. We set d =f= 1 for simplicity to get 
_ -Qjt _ 1 

771 “ ApP 772 “ * 

In other words, Tt! is a non-dimensional flow rate and tt 2 is the length to diameter ratio. 
The pipe flow problem can be described by only two non-dimensional numbers, so that 


TTl =/«(TT 2) 
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Dimensional analysis and model testing 

To carry out scaled model tests, it is important that we should follow the similarity principle. 
This basically consists of the geometric similarity and the dynamic similarity. First it is vital that 
the geometric similarity is satisfied between the test model and the prototype. Only after that 
should we proceed with the scaled model test, ensuring that the flow around the prototype and 
that around a test model are dynamically similar. 

To satisfy the geometric similarity, we must ensure that all body dimensions have the same 
linear scale in all directions. In other words, corresponding points of the prototype and the 
model must be related by the same linear scale ratio. For example, for a l/10th scale wind- 
tunnel test of a wing (Figure 1.44) the model must have 

• thickness, width and length that are l/10th those of the prototype 

• nose radius that is l/10th of the prototype 

• 1/1 Oth of the surface roughness as compared to that of the prototype 

• l/10th the thickness of coating, if that is either painted or lacquered. 

In addition, both the prototype and the model must have the same relative posture to the 
flow direction. This ensures that they have the same angle of attack. The dynamic similarity is 
guaranteed by making sure that all relevant non-dimensional numbers are the same during the 
scaled model tests. 




o o 

Figure 1.44 Prototype wing (a) and the 1/10th scale model (b) which are 
geometrically similar 


When we carry out a wind tunnel test of an aircraft, for example, we need an exact scaled 
replica of the prototype aircraft in order to fulfil the geometric similarity. Dynamic similarity 
in the test will be satisfied by carrying out the model test at the same Reynolds number 
and Mach number as in the prototype flight. If the Mach number is small, and therefore the 
compressibility effect is negligible, then the only relevant non-dimensional number to consider 
will be the Reynolds number to satisfy the dynamic similarity. 


Figure 1.45 Wind tunnel test of the 
X-48 blended wing body at NASA 
Langley Research Center 
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Worked example 

Derive the flow speed at which a scaled model experiment should be carried 
out, when the only relevant non-dimensional quantity is the Reynolds number. 


If the Reynolds number Re is the only n 
considered, we must ensure that 


n-dimensional quantity for the flow being 


V P L P_ 


So that, we have 


st be determined by the scale of the 


In order words, the velocity V m in the model tt 
model and the kinematic viscosity ratio. 

For example, if the model size is a tenth of the prototype in the same working fluid 
(v p = vj, the test speed V rn must be 10 times faster than the prototype speed V p in order 
to ensure the dynamic similarity. This is not always easy to achieve. There are special 
experimental facilities to attain high Reynolds numbers through a change in fluid density 
or viscosity. 


Worked example 

A test for high speed boat will be carried out in a towing tank. Here, we 
must consider the Froude number as the second non-dimensional quantity in 
addition to the Reynolds number. What are the conditions at which we should 
carry out the tank test to ensure the dynamics similarity? 


We must have the following relationships in order to ensure the dynamic similarity: 
Re fJ = Re m and Fr p = Fr m 


Therefore, 


V p v m 

-T= = Fr = -=^= 

y]i L P \!i L m 


The only way to satisfy these two conditions is 



K, = ;K Yu _ Ynhn = f 

Vp 'K V P V p L p \L p ) 


In practice, the first condition can be met relatively easily, but the second condition is 
difficult to realize. 
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Similarity rules in turbomachinery 

Turbomachinery can be classified into two types - pumps and turbines. Pumps are used to 
carry out useful work, such as to circulate hot water in a central heating system, by adding 
energy to the fluid through the rotation of impellors. Turbines, on the other hand, extract 
energy from the moving fluid through shaft work as the blades or impellors rotate. In 
this section, we describe the performance of centrifugal pumps (Figure 1.46) through an 
application of similarity rules. 



Figure 1.46 Centrifugal pump, where the flow moves in (1) and exits from (2). 


If we apply the Bernoulli equation at the inlet (1) and outlet (2) of the pump, we have 


Therefore, 


\P<? 2 <? 


I ft? 2? 


— + V + z) = h s ~ h f = H 


(1.67) 


Typically inlet and outlet areas of centrifugal pumps are similar (A x ~ A 2 ), therefore V\ ~ V 2 . 
Also, their locations are usually very close to each other (z^ ~ z 2 ). Therefore, 


= Pi ~ Pi 
Rg RJ? 


(1.68) 


This is called the net pump head. 

The power delivered to the fluid is called the water horsepower, given by 

K = P gQH 

while the power required to drive the pump is given by 
bhp = mT 

This is called the brake horsepower. The efficiency of the pump is defined as a 
water horsepower to the brake horsepower as 

P w PSQH 
71 bhp toT 


(1.69) 

(1.70) 

of the 


(1.71) 
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Worked example 

Find out non-dimensional quantities describing 
the performance of a centrifugal pump, where 
we can assume that there are four (n = 4) physical 
quantities given by 


pump head gH 
flow discharge Q 
impeller diameter D 
shaft speed n 


[L 2 T“ 2 ] 

[L 3 T -1 ] 

[L] 

[T- 1 ] 



There are only two basic dimensions (m = 2) in this problem. From the tt theorem, 
therefore, there must be only two (n — m = 2) non-dimensional quantities describing the 
pump performance. 

In order to find these two quantities TTj and tt 2 , we first pick two physical quantities D 
and n. We can easily see that these two cannot form a non-dimensional quantity. 

at x =D“-n b -Q c tt 2 = D d -n e -(gH)f 

Substituting the basic dimensions into these, we have 

Tt! = [L“] ■ IT-*] ■ [L 3c ■ T -c ] tt 2 = \L*] ■ [T~<] ■ [L 2 /-T- 2 /] 

wD 3 gH 

After some calculation, we find that TT} = q and tt 2 = n 2£)2 - Since the Tt theorem 
cannot give a functional form of a non-dimensional quantity, we take the liberty of 
making an assumption on its form. Conventionally, we use ^^3 rather then H q - 
Therefore, we set 


1 


Q 

' wD 3 


The performance of the centrifugal pump can be expressed as tt 2 = f„ (tt/): 
n 2 D 2 J»{nDi) 


where, C H and Co 


C H =/„(C Q ) 

e the head coefficient and the capacity coefficient, respectively. 
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Centrifugal pumps 


In general, pump performance is dictated not only by the flow discharge, impellor diameter and 
shaft speed, but also by the fluid density and viscosity as well as the surface roughness of the 
impellor. We should, therefore, write the pump head gH as a function of all of these variables. 

gH =fi(Q, A n, p, p,, s) 

from which we can find 

Ch = gi (Cq, R e , q) 

using Buckinghams theorem. Likewise, the brake horse power bhp has the following functional 
relationship: 

bhp =f 2 ( Q,D , n, p, p, s) 

from which we can find 


Here, 


C P — £2(Cq, R a S/ D) 


Capacity coefficient . . . Cq = — — ^ 
gH 
n 2 rP- 
bhp 

“ prr’Lfi 

p nD 2 


Head coefficient . . . C H 
Power coefficient ... C P = - 
Reynolds number . . . Re = 
Roughness ratio . . . 




(1.72) 

(1.73) 

(1.74) 

(1.75) 

(1.76) 


And the efficiency of the centrifugal pump T) is given by 


_ P gHQ 
71 bhp 


C H ' Cq 


- &(C q) 


If we assume that the effects of R e and s/D are small (i.e. the Reynolds 
number is sufficiently large and the internal pump surface is 
hydrodynamically smooth), we have 

Ch = gi (Cq), C P = g 2 (Cq) 

which are exactly the same as in the previous worked example. 

Unique performance curves exist for a family of pumps with similar 
geometries when the pump performance is plotted in terms of 
non-dimensional quantities such as Cq, C h and C P (Figure 1.48). 
These save us much effort and time by using only one set of 
non-dimensional curves to describe the pump performance of an 
entire family. 

For example, when the pumps from the same family are compared at 
the homologous points, such as at the best efficiency point (BEP), 

Cq, C h and C P are equal. Therefore, 



02 = '_h(D2\ 3 ^2 = IH2\ 2 (P2\ 2 

Qi «i\A/ A UJ \A/ 


El 

Pi 


P2 

Pi 



111 = B2 
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Cq 

Figure 1.48 Performance of centrifugal pumps 
in non-dimensional quantities (F. M. White, 

2008, Fluid Mechanics, New York: McGraw Hill. 
Reproduced with permission of The McGraw-Hill 
Companies) 
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Specific speed 


It is useful to know the type of pumps at the early stage of design 
when only the required flow discharge and pump head are known 
together with a likely shaft speed. For this purpose, we define the 
non-dimensional specific speed Ns’ as 


Ns' = 


C H 4 


k-Q * 1 


(1.77) 


where, Cq = 


Q* 

(iD 3 


and 


Ch — 


n 2 D 2 


a 


% s- a 


1.0 — | Centrifugal Mixed Propeller 

0.9- 



Mixed Axial 
flow flow 


where, ( ) indicate the values at BEP (best-efficiency point). Pump designers Figure 1.49 Non-dimensional specific 
can identify the type of required pumps by calculating the specific speed Ns’ speed of different type of pumps 
from Q , H and n without requiring the size of the pump D. 


As seen in Figure 1.49, centrifugal pumps with large pump head H and 
small flow discharge Q have relatively small specific speed Ns' ~ 0.10, 
while axial pumps (small H and large Q) tend to have larger specific 
speed Ns' = 0.5 ~ 1.0. 


Learning summary 

By the end of this section you should have learnt: 

/ non-dimensional numbers are important in understanding the characteristics of the flow as well as 
in comparing the type of flow with others; 

/ Buckingham's theorem gives not only the number of non-dimensional quantities involved, but it 
also determines each non-dimensional quantity; 

/ to carry out model tests, we need to ensure both the geometric and dynamic similarities are satisfied; 

/ one can identify the shape of required pumps by calculating the specific speed without knowing 
the size of the pump. 
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Unit 2 


Thermodynamics 


UNIT OVERVIEW 

■ Introduction 

■ Air conditioning 

■ Gas mixtures 

■ Combustion 

■ Reciprocating compressors 

■ Heat transfer 

■ Heat exchangers 

■ Vapour power cycles 

■ Reciprocating internal combustion engines 


2.1 Introduction 

This unit continues from the foundation in thermodynamics laid out in An Introduction to 
Mechanical Engineering: Part 1. Its aim is to provide an applied emphasis to the concepts already 
learnt. The applications presented here concentrate mainly around power generation from oil, 
gas and coal by combustion to provide heat transfer to produce work via steam power. The first 
section reviews and refines the information that is relevant from the thermodynamics in Part 1, 
and the subsequent sections provide analysis techniques for practical engineering applications. 

The material covered will address the properties of working fluids as in Part 1 ; perfect and 
semi-perfect gases and steam are of interest. Here refrigerants are introduced along with 
combustion reactions and product gas mixtures, with a focus on application. Our 
attention is on what a working fluid can achieve. By manipulating the working fluid, practical 
ends can be met: 

• air conditioning to affect the temperature and humidity of atmospheric air; 

• producing electricity by heating steam with combustion reactions, which drives generators 

via steam turbines; 

• gas and vapour compressors increase pressure at a given flow rate of fluid; 

• combustion gases are used directly to drive engines. 

Transferring or insulating heat is involved in all practical thermodynamic processes, and 
the modes of transfer and the basic techniques for calculation of heat transfer are presented. 
This unit considers how classical thermodynamic machines work and how to calculate the 
relationships between heat and work. 
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The first section considers mixtures of gases, seeing how the volume and mass of each gas 
in a mixture relates to the others. Later sections progress on to the chemistry of combustion 
reactions and then to the energy release of the reactions whether using gas, liquid or solid 
fuels. Moisture is a life-supporting component of our environment and is contained in the 
atmosphere - rain, mist and humidity — and in reactants and products of combustion. Mixtures 
of gases with moisture have specific properties that require special treatment. The unit shows 
how to control atmospheric humidity. 



Figure 2.1 Schematic of Drax power station (Drax Power Limited) 


Other characteristics of the thermodynamic machine require heat transfer, either to produce 
work or to achieve desired temperatures. Situations are considered in which steady heat and 
work transfer occur (‘steady’ meaning processes that do not vary over time). A particularly 
important thermodynamic system is the steam power plant as shown in the schematic in 
Figure 2.1 of Drax power station - the largest in the UK. This unit shows how steam can be 
used to produce power, and how the devices within power stations relate to each other to 
produce useful power output. It relies not only on the internal steam cycle, but also on the 
effective combustion energy release and heat transfer to the steam at the correct rate. 

The law of conservation of energy states that energy can be neither created nor destroyed 
but only transformed from one type to another, and this is the essence of the first law of 
thermodynamics met in Part 1, and henceforward referred to as ‘the first law’. Heat and work 
are forms of energy; they have the same units as energy. The laws of thermodynamics consider 
the relationship between heat and work and the effect that the transfer has on the energy of the 
matter that undergoes the transfer. 

It is important to be able to define the system and the control volume. This means having 
in mind what physical part of a thermodynamic device is under consideration, in order to deal 
with it in isolation. A sketch of the control volume within a process is often useful. Figure 2.2 
shows a crude representation of a thermodynamic system where cool stream of fluid enters a 
tube, is accelerated by a propeller and receives energy in the form of work, proceeds to receive 
energy in the form of heat from a hot source (whether combustion or some other source) 
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Figure 2.2 A control volume in a crude open system thermodynamic device 
having work and heat, real boundaries at physical walls and imaginary 
boundaries within the fluid, marking the extent of the fluid under consideration 


before passing out of the system. A control volume within the system describes what happens 
in a particular part of the system where some thermodynamic action occurs and it is useful 
to separate out this space, breaking down the complex system into manageable parts. The 
real solid boundaries (perhaps moving or flexible but nonetheless impervious) are usually 
relatively easy to define. An imaginary boundary may be used to separate out internal, 
connected parts of the working fluid in process, by careful consideration of the separation 
point required to isolate the region of influence of heat, work and mass transfers. The system 
has to do with the fluid itself rather than the system containing it. The control volume is often 
specified by reference to an artefact, but it is important to note the distinction. 

In Part 1, both closed systems - such as pistons in closed cylinders - and open systems were 
considered although no specific open systems were discussed. The open system is of significant 
importance in this unit, and the first law in the form of the SFEE (steady flow energy 
equation) dominates the analysis. 

The direction of heat and work transfer is important, since maintaining an energy audit is 
vital. It is easy to lose track of an energy budget when the system is large and complex with 
many processes occurring. Even though the subsystems can be separated, e.g. considering just 
one cylinder in a four-cylinder engine, it is important to maintain an energy inventory in 
the same sense. It is logical to consider the transfer of heat and work to the fluid as positive, 
and that is the convention adopted in this unit. This is the European convention. However, for 
argument’s sake, and for completeness, it is useful to consider the alternative convention. The 
American method considers useful work output from the system as positive — a convention that 
has particular merit since it is usually work achieved by a machine that is of most interest. Note 
that the terms above indicate a deliberate relationship to financial accounting — just like money, 
energy has to be accounted for properly. 

Equilibrium is a concept that has been developed on the understanding of kinetic energy 
(KE) and potential energy (PE). If a system is in unstable equilibrium it is likely to lose PE to 
KE. In the broader thermodynamic sense, there are several forms of available PE that can be 
unstable. However, when and only when all are balanced, a state of equilibrium is achieved. For 
example, after complete combustion when no further oxygen is available and the energy state 
of the products is extremely adverse to a reverse reaction, or when the temperature is the same 
throughout, or a vapour is in contact with its liquid at constant volume and temperature. A 
state of equilibrium is one where measurable properties do not change with time. 

In order to change from one state to another, a process must be followed. The idea of a 
sequence of processes in a cycle enables cyclic machines, the cycle implying a repeating 
sequence of events physically, but in the thermodynamics sense it has a very specific meaning. 

A cycle is a sequence of processes that alter the state of the working fluid and then return it 
to its initial state. Cyclic processes are used to produce or make use of external work. Para- 
cyclic machines are very common, in that new material of the same properties is induced from 
the environment at one point in the cycle and ejected to the environment at another point in 
the cycle, to be recycled at some stage. These are open-cycle machines such as jet engines and 
internal combustion engines. 
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Reversibility of processes is an important concept — it is important to 
understand how a reversible process might be possible and what the useful 
output from such a process is. It is only possible in very controlled conditions 
and with very slow processes, so it is practically unachievable, but it implies 
a state of ideal processing of the working fluid with least waste of effort. 

The reason for this will be established when we look at the second law of 
thermodynamics (henceforward referred to as ‘the second law’) in due course. 
On a typical process diagram, say pressure versus volume of a gas undergoing 
processes, we can indicate reversible and irreversible processes (Figure 2.3). This 
indication convention can be important and should be noted - dotted lines show 
irreversible processes, solid lines show reversible processes. Most lines drawn for 
real processes will therefore be dotted, but solid-lined reversible processes are 
often useful for comparison in order to calculate losses due to irreversibility. 

The first law of thermodynamics 



Figure 2.3 Schematic graph of 
pressure versus specific volume, 
illustrating a reversible process with a 
solid line and an irreversible process 
with a dashed line working on a gas 
between two states A and B 


The first law is very important - telling us about the exchange that happens between work 
and heat. In words: 


When any closed system is taken through a cycle, the net work delivered to the surroundings is 
proportional to the net heat taken from the surroundings. 

Equation (2.1) is a statement of the first law, and it represents the conservation of energy principle: 
Q+W=U 2 ~U 1 = m(u 2 - «i) (2.1) 


where 

Q [J] is a quantity of heat transferred in the process 
W [J] is the work done on the system during the process 
U [J] is the internal energy of the working fluid 
m [kg] is the mass of the working fluid in the system 
u [J/kg] is the specific internal energy of the working fluid 


States 1 and 2 are the initial and final states of the system. Internal energy is the amount of 
energy possessed by the molecules of matter in the system in their kinetically energized states. 
The formula may be expressed in differential form as: 

dq + dw = d u (2.2) 

Note that the capital letters indicate bulk quantities, but importantly, lower case indicates 
that only 1 kg of the working fluid is being considered. As written here it concerns the 
conservation of energy in a closed system - i.e. where there is no new mass coming in. It is 
valid for both reversible and irreversible processes because it is just concerned with end states 
and how much heat and work have been transferred to the system. This law is unaffected by 
what the route between the states is. 


The law in words actually talks about the work delivered to the surroundings and the heat 
taken from them during a cycle — note that a cycle means that the state has returned to its 
starting point, i.e. Uj = u 2 .This is due to the route of the work, which can be displayed on a 
state diagram such as the p — v diagram, and there can be a variety of different routes. Similarly, 
there can be a variety of different routes that result in the same work. However, the route is 
usually defined by the engine that is acting on the system. 

The steady flow energy equation is the first law applied to an open system, and can be expressed as: 
Q12 + W l2 = in[h 2 ~ h x + C2 ~ Cl + g(z 2 - z,)) (2.3) 

where 

C [m/s] is the speed of the fluid 
g [m/s 2 ] is gravitational acceleration 
z [m] is height above a fixed datum point 
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Here, heat, work, enthalpy, kinetic energy and potential energy all have same dimension of 
energy the joule, J. A superscript dot implies a rate, i.e.J/s or W and kg/s. Enthalpy is the flow 
process equivalent of internal energy. Since h = u + pv, it consists of the internal energy plus 
the ‘flow work’; the pv part. Mostly in the machines that we consider, changes in velocity (KE) 
and in height (PE) are negligible compared to changes in enthalpy. For example, consider that 
the change in KE accelerating 1 kg of water from 1 to 10 m/s is 50 J and the PE for raising 
the same through 10 m is approximately 100 J; the enthalpy change of water increasing in 
temperature from 10°C to 20°C — a relatively modest temperature change - is 42,000 J! Since 
KE and PE can often be neglected in reality, the SFEE can be reduced to equation (2.4) that 
shows a close resemblance to the first law for the closed system: 

Q12 + W 12 = m(h 2 ~ h x ) ( 2 . 4 ) 


Properties 

There are several properties to he aware 
Extensive 
Volume, V [m 3 ] 

Mass, m [kg] 
Enthalpy, H [J] 
Entropy, S [J/K] 


f, and the following table shows a selection. 
Intensive 
Density, p [m 3 /kg] 

Pressure, p [N/m 2 ] 

Temperature, T [K] 

Specific enthalpy, h [J/kg] 


In the right-hand column are intensive properties. In the left are extensive properties. 
Intensive properties are the same for any quantity of material being considered. Extensive 
properties depend on the amount of material. Intensive properties are mostly expressed per kg 
of material; this is a specific measure of an extensive property. Extensive is indicated by capital 
symbols, intensive by lowercase. In changing from extensive (the whole) to intensive (specific), 
H becomes h, S becomes s, V becomes v; T, and p are inherently intensive and a specific 
measure cannot he stated, h, s, v are specific enthalpy, specific entropy and specific volume. 


Equation of state 

The equation of state relating p, v and T is given by equation (2.5): 

pv = RT, ( 2 . 5 ) 


pV=mRT ( 2 . 6 ) 

where R is the specific gas constant having units J/kgK. Note the alternative use of specific and 
total volume, v and V . These equations hold for all gases. 

Specific heat capacity is defined separately for constant pressure or constant volume processes, 
and is directly related to the first law since specific heat capacity at constant pressure is defined 


which leads to 


= JUlI 

Cp dT \p 


h 2 ~h = Cp(T 2 - T,) 


And specific heat capacity 


which leads to 


constant volume is defined as: 

— dtt I 
Cp ~ dT |„ 


«2 “ «i = c v (T 2 - TO 


( 2 . 7 ) 


(2.8) 


Specific heat capacity is the amount of heat absorbed by 1 kg of gas when its temperature is 
raised by 1 K at either constant pressure or constant volume. A semi-perfect gas describes 
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the behaviour of most gases except the very light ones (helium and hydrogen) — specific 
heat capacity varies with temperature and it is useful to take the average from tables of 
thermodynamic properties when calculating heat transfer to or from a quantity of gas. 

For a perfect gas, the relationship between the constants associated with the gas is important 
and useful. It is important to realize that since the specific gas constant R and the ratio of 
specific heats, y (gamma) are usually known, it is possible to work out the specific heat 
capacities from the quantities above. This is illustrated in the following derivation: 

Since 

h = u + pv, dh = du + d (pv) 
so, 

c p dT = c„dT + RdT 

c p = c v + R (2.9) 

Also 

7 = 7 

Since y and R are usually known or determinable, useful relationships with c p and c„ can be 
formed, e.g. 

y c v = c v + R 


so 


R 

(y - 1) 


(2.10) 


Processes 

Processes describe how a working fluid is changed from one state to another. The following 
list describes processes met in Part 1 : 

Adiabatic = no heat transfer, ptfl = c, q = 0 
Isothermal = constant temperature, pv = c 
Isobaric = constant pressure, p = c 
Isochoric = constant specific volume, v = c 
Polytropic = unspecified process type, pv" = c 
Isentropic = constant entropy, pv 1 = c, q = 0 

Isentropic implies an adiabatic process that is reversible. This is commonly achieved 
approximately in open system devices such as axial flow turbines and compressors. 

The p— v representation can be changed to a relation between p and T or v and T by using the 
equation of state. 

^ , . , . mRT . mRTivf mRT 2 v 2 n . 

For example, m pv n = c, substitute p = — ~ — to give = , i.e. 

= 7> 2 " _1 

Work occurs when the boundary of a system moves against a force — this can be an external 
atmosphere, or a mechanical device. Work can be expressed as: 

W= ~j x pdV (2.11) 

where 

W[ J] — work done on the working fluid 
p [N/m 2 or Pa] — instantaneous pressure of the working fluid 
V [m 3 ] - instantaneous volume of the working fluid 
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Equation (2.11) expresses the work done in a closed system per kg of working fluid, for a 
reversible process, and relies on knowledge of the pressure at every stage. A reversible process 
is ideal, so equation (2.11) tells us the ideal work assuming no friction, no turbulence, nothing 
unrecoverable. Work is negative because of the first law sign convention that we have chosen 
to use, such that if the gas has expanded, the energy has gone from the system in order to work 
against the exterior. It is useful to know how to calculate the ideal work done for a process, to 
compare with the actual work and hence quantify the losses of energy from the system. 

A p-v diagram, illustrated in Figure 2.4(a), is a state diagram, which shows the relationship 
between pressure and specific volume. The limit of an individual process is bounded by 
constant pressure and constant volume process lines (horizontal and vertical respectively) and 
with curves between representing polytropic processes, including isothermal ( n = 1) and 
adiabatic (y = h) processes, n is always positive. Other process lines may be possible if the 
pressure— volume relationship is prescribed by an external mechanical device, such as a spring 
on a piston. The area under the curve represents the work done for a reversible process. In 
irreversible processes, work is ‘lost’ in overcoming friction, in the turbulent fluid and in other 
such resistances. 

p-v diagrams 

Pi 3 


O 2 


Figure 2.4 Schematic graphs of typical state diagrams using pressure and specific volume to 
indicate what is happening to the working fluid: (a) shows the effect of polytropic index on 
ideal processes from 0 (constant pressure process) to oo (constant volume process), (b) shows 
a cyclic process involving four separate processes and (c) shows a real paracyclic process, 
the Otto cycle for petrol engines. 

For a cychc process (Figure 2.4(b)), where the original state is returned to, the area contained 
in the loop of the cycle is the work output (or input) to the system. The sign depends on the 
directional sense of the process path. Considering a clockwise process, net work is produced by 
the system; positive compression of the gas is indicated in the lower curve, and heat is added 
to the gas in order to make it work to expand against the external system in the upper curve. 
The area under the upper curve is greater than the area under the lower one and net work is 
negative (more work is removed from the gas than is put into it, and the working fluid does 
work on the surrounding machinery). If the cycle were anticlockwise then the positive work 
(to compress the gas) is the area under the upper curve; a larger amount of work is put into the 
system than is taken out and the system must lose heat to do this. 

Figure 2.4(c) shows a typical p-v diagram for a four-stroke engine — a polytropic compression 
from 1 to 2 as the crank drives the piston up. Combustion from 2 to 3 as the piston starts to 
he driven down by the expanding combustion gases. A polytropic expansion follows from 3 to 
4, the pressure at 4 being still above the atmospheric in-draft pressure at 1 - the gases still have 
expansion to do if there were enough room to move. The combustion gases are exhausted from 
4 to 5 and a new draught of fresh air is taken in as the crank drives the piston down from 5 to 
l.The work done against the surroundings is the shaded area in the p-v graph. By controlling 
the form of the processes between states in this way, a significant amount of work is produced. 
The upper area is the useful work out; the lower slim area is the pumping work to get the 
exhaust out and the fresh air in and is carried by the other cylinders in the cycle and by the 
flywheel. Note the direction round the cycle is clockwise — net work out. 
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The second law of thermodynamics 

It is important to consider the efficiency of processes. In the p-v diagram for a cycle there is no 
indication about the effectiveness of the relationships between heat and work. Given that work 
cycle, how effective was the conversion of heat to work? How much heat was required to get 
that quantity of work done? The second law addresses this issue; in words it can be stated as: 

It is impossible to construct a system, which will operate in a cycle, extract heat from a 
reservoir, and do an equivalent amount of work on the surroundings. 

From the second law there are eight corollaries, i.e. facts that follow from it and can be proved 
by logical arguments as described in Part l.They can be summarized as follows: 

(1) Heat can’t pass from cold to hot without work. 

(2) Reversibility implies maximum efficiency. 

(3) Temperatures of heat source and sink determine maximum efficiency. 

(4) There is a universal zero value of temperature. 

(5) If you exchange heat with more than two reservoirs the efficiency reduces from the 
maximum possible. 

(6) /dQ/ T for a complete cycle is zero if reversible, and less than zero if irreversible 

(7) fdq/T between two states is the change in entropy, s. 

(8) If q = 0 then 5 increases or remains constant. 

The first corollary is part of everyday experience and is perhaps the best way for an engineer 
to remember the second law. The second corollary is not obvious unless the second law is 
considered, except that we intuitively know that friction causes energy to be lost to disorder. 
The third is very interesting — the extreme temperatures available to a heat engine determine 
the best that it can possibly perform. If it is possible to raise the upper temperature or lower the 
sink temperature then efficiency will increase. Number four is simple to remember, but it was a 
new concept since most temperature scales before the second law depended on the behaviour 
of a material such as water or alcohol. The fifth is reasonably obvious — increasing the number 
of devices that transfer heat in the system introduces more disorder. Number six is surprising, 
and is where we find the difference between reversible and irreversible processes. This sets 
the scene for the most efficient cycle possible — the Carnot cycle, introduced in a subsequent 
section. The seventh is the definition of entropy; this affects the system and the immediate 
surroundings, i.e. the level of disorder in the external reservoirs of hot and cold. The final one 
is a condition for a cycle — if a cycle involves no heat transfer then it is irreversible if entropy 
increases and reversible if entropy does not change. 

Figure 5(a) shows a symbolic representation of a heat engine; it has some undefined 
machinery in the circle, which by virtue of heat exchange with the hot and cold reservoirs, is 
able to develop work on the surroundings. We define cycle efficiency as: 

_ Work done _ ^out _ 1 ~ Qlow 
11 ~~ Heat supplied “ Q hlgh “ Q t „ gh 

(Q} Source of heat I T hot I Q 

Qhighl 

Prime mover f j — 

Heat sink I T cold I 

Figure 2.5 Schematic representation of heat engines; (a) shows a heat 
engine designed to produce work out of the working fluid, and (b) 
shows how the second law allows work to be entirely converted to heat. 
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The most important observation here is that, in order to produce work output, heat has to be 
lost to a cold reservoir. To produce work the engine must exchange heat with both the hot and 
the cold reservoirs. With only one reservoir (Figure 2.5(b)) work can be done on the system, 
but not produced by it. 

Work and heat occur at system boundaries when the working fluid changes its state. Work 
implies that the boundary of the system is moved; heat implies that there is a change of 
temperature of the system. Heat transfer is a less useful form of energy transfer than work 
because all energy transfer by work can be converted to energy transferred as heat without 
having to lose any energy in the exchange, but not all heat transfer can be converted to energy 
transferred by work; in using heat transfer to produce work, some heat supplied must be lost, or 
transferred to a low temperature, and hence low energy content reservoir as part of the process. 


Figure 2.6(a) shows various flow devices that are used in thermodynamic machines to relate 
the working fluid to the outside world. A compressor does work by sucking gas axially into a 
series of disks with aerofoil blades on the periphery. As the gas is compressed it passes through 
ever narrower passages with smaller blades, until it exits at high pressure. These are commonly 
used in the inlet of a turbofan jet engine. The turbine is a compressor in reverse — it takes a 
hot, highly compressed fluid and expands it through ever-widening passages to get work out 
while reducing the pressure and the energy of the working fluid; a turbine is found at the rear 
of a jet engine. A common symbol for a compressor or turbine is the quadrilateral shape at 
top right of Figure 2.6(a) . Pumps are used to deliver a volume of pressurized liquid, in which 
there is relatively little energy increase in the liquid because liquids are virtually incompressible. 
The throttle is a constriction on the flow, the most common example being a tap on a kitchen 
sink. A throttle involves a pressure drop, but is usually at a sufficiendy low flow rate so that the 
kinetic energy change is insignificant. In this case, the change of enthalpy is negligible — there is 
an exchange between u and pv internally. 

Placing these flow processes in a cycle, as in Figure 2.6(b), we see a schematic of a realistic heat 
engine, the vapour power cycle. The hot source of heat and the cold heat sink are external 
to the system - they transfer heat across pipes that contain the working fluid - a liquid being 
pushed through the system by a pump is energized by the heating process and converted to a 
vapour, producing work through a turbine. The fluid is then condensed back to a liquid in the 
cold sink before repeating the cycle. The elements of this machine are a simplified description 
of what occurs in a power-generation station. 


Real processes 


Common turbine 


Practical application 



o 


o 


Figure 2.6 (a) Flow processes that alter the state of a working fluid in an open process; 
(b) a practical implementation of the heat engine. 
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Having a basic heat engine to analyse, we now consider its efficiency. The Carnot efficiency 
is a very important measure. Referring to the heat engine of Figure 2.5, the cycle efficiency 
is the ratio of the work obtained to the heat required. It can be seen from the heat engine 
schematic that work out is the difference between the heat in and heat out. This is determined 
by the temperature of each of the reservoirs: 

_ Work done _ W out _ 1 ~ Qi ow _ 1 ~ T lovl 
"ncarnot Heat supplied Qhigh Qhigh Thigh 

Q [W] — rate of heat supplied to a reservoir of heat energy 
T [K] - temperature of a reservoir of heat energy 

This shows that the higher the temperature difference, the better the efficiency. The most efficient 
machine will exchange heat with a sink at absolute zero. For all real cycles, efficiency is lower 
than the Carnot Efficiency because a small amount of work is required to drive the process (the 
pump in the vapour power cycle) and because heat is added at more than one temperature. 

Entropy is a measure of disorder; the state of agitation of a system; the energy allocated to 
maintaining a state of disorder over the ordered state. This property is defined from the second law: 

^reversible = Tds 

where 

q [[/kg] — specific heat transfer 

T [K] - instantaneous working fluid temperature 

s [J/kgK] - instantaneous specific entropy of the working fluid 

this is the heat transfer equivalent of dw reversl y e = —pdv. It is the minimum heat transfer required 
to get from one state to another. This applies for any reversible process undergone by a closed 
system. 

Isentropic efficiency is used to characterize the performance of many open-system machines. 
Since open-system machines (for example, axial flow turbines and compressors) operate under 
near adiabatic conditions, and a reversible adiabatic process is the most efficient, it is useful to 
compare the actual machine work with the ideal machine work. The isentropic work output 
of a turbine is greater than the energy transferred out of the fluid in practice, and the isentropic 
work input to a compressor is less than the theoretical input power required to drive the 
compressor, i.e. the useful increase in energy to the fluid is less than the isentropic amount. The 
work is measured in terms of change of working fluid enthalpy, and can be determined from 
the steady flow energy equation. 



Figure 2.7 Isentropic efficiency is defined as the ratio of ideal work 
to actual work in an axial flow compressor or turbine. 


The state diagram in Figure 2.8 describes the properties of steam. It is useful to notice the 
characteristics of the chart. The standard notation is to use f to indicate saturated liquid and 
g for saturated gas. If the temperature is 278 K (5°C) and the pressure is 1.013 25 bar (i.e. 
the standard atmospheric pressure at sea level) then the vapour is a sub-cooled liquid. If we 
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State diagram for phase change 
T—v diagram for steam/water 

400 
250 
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Figure 2.8 State diagram for water in terms of pressure and specific volume. The area under the 
curve is the region where vapour and liquid exist together, the left-hand side labelled ‘f’ indicates 
the line separating saturated liquid from the mixture region, and the right-hand side labelled ‘g’ 
indicates saturated vapour, separating superheated vapour from the mixture region. Note that the 
constant pressure lines, which are horizontal in the mixture region, follow approximately the ‘f’ 
line and rise sharply in the superheated steam region. 



reduce the pressure slowly to 0.008 719bar, boiling will occur. This happens on solid surfaces 
within the fluid, and bubbles rise and expand within the fluid. 0.008 719 bar is p s , or saturation 
pressure, for 5°C; similarly 0.023 37 bar is p s for 20°C. Other points to note from this chart 
are that the saturated liquid region is very narrow, i.e. liquid volume doesn’t change a great 
deal with temperature, certainly up to 100°C at atmospheric pressure. The volume difference 
between f and g grows as the temperature falls. Phase change diagrams, regardless of the 
properties compared, all have this approximate form, with division into the three regions. The 
shape of the diagrams is always a dome containing the vapour— liquid mixture with the f-g 
dividing line. The position of the dome and its width vary with properties of the particular 
fluid. 


Just as the area on the p— v graph represents the work done for a reversible 
shows the T—s diagram, in which the area represents the heat transfer for 
The heat transfer for the reversible case is useful because it is the 
limiting heat transfer, to which actual heat transfer can be compared. 
Although it has no meaning for irreversible processes, it shows the ideal 
for a particular type of process. It is particularly useful for liquid— gas 
materials such as water and refrigerants. 

The T—s diagram for steam in Figure 2.9 has constant pressure lines 
drawn on it. The pressure lines are horizontal through the central 
vapour-gas mixture region under the dome, and then follow the 
contour of the saturated liquid side of the dome (the left-hand side 
from the critical point at the apex). Entropy for water is assigned 
a value of zero at zero Celsius for convenience. By the third law 
of thermodynamics the absolute value of entropy of any substance 
is zero at zero K. Despite this, use of an alternative reference state 
is helpful. This diagram is particularly useful for the forthcoming 
vapour power cycle section of this unit and for refrigeration. This 
is very useful regarding the Carnot efficiency, against which the 
efficiency of all plant is compared. Turbines are considered as constant 
entropy devices. 


process, Figure 2.9 
reversible process. 


State diagrai 
-s diagrai 


l for phase change 
i for steam/water 



Figure 2.9 State diagram for water in terms 
of temperature and specific entropy. The 
characteristics of this chart are similar to 
the p-v chart, except that the dome is more 
spread out on the horizontal axis. 
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Learning summary 


By the end of this section you will have learnt: 

/ there are principal descriptive terms used in the previous volume that are now to be used with an 
emphasis placed on how they find application in thermodynamic processes. In particular, the ideas 
of cycles, processes and irreversibility are useful for considering practical thermodynamic systems; 

/ the first law of thermodynamics is directly represented by the steady flow energy equation in most 
engineering processes because the fluids move through the system. The effects of potential and 
kinetic energy terms are considered to be negligible in many practical thermodynamic processes. 
The equation then reduces to a form that considers the interchange between the enthalpy of the 
fluid and the energy transferred to it in the forms of heat and work. This is the most important and 
straightforward relationship to remember for applied thermodynamics - it is the cornerstone of 
applied thermodynamic machines-, 

/ the properties of thermodynamic working fluids are important in determining thermodynamic state, 
and the equation of state for perfect and semiperfect gases is used in most calculations involving 
machines where a gas is the working fluid; 

/ thermodynamic processes change the state of a working fluid, and cyclic processes and heat and 
work transfers in a series of processes are commonly used to produce heat and work; 

/ the work done on the fluid in a closed process is W= -j^pdV for a reversible process; 

/ state diagrams and the representation of processes on them indicate what is going on in heat 


/ the second law of thermodynamics describes how heat and work can and cannot be converted in 
a thermodynamic machine; 

/ the eight corollaries of the second law are principles of real thermodynamic processes and 
demonstrate pragmatic observations relating to the second law; 

/ cycle efficiency describes how well a machine takes ingoing energy and converts it to useful 
output; 

/ the practical components of real thermodynamic machines are turbines for work out, pumps and 
compressors for work in, and throttles for reducing pressure; 

/ the Carnot efficiency is the measure of the maximum achievable performance of a heat engine; 

/ isentropic efficiency is a comparison of the real process power output to the ideal, reversible 
process power output. 


2.2 Air conditioning 

Air conditioning is mainly concerned with controlling temperature and humidity. It is 
important for controlling the comfort of living organisms and for keeping complex electronic 
circuitry within suitable climatic conditions. 

Comfort air conditioning for people: 

— people have a limited comfort zone due to the requirement to maintain a steady core 
body temperature of 37°C; 

— people produce heat and release moisture into the atmosphere; 

— people generate heat at ~80W resting, 120W office work, and up to 400W during 
physical work; 

— people produce sweat at varying rates, and 100% humid air during respiration. 

Control conditioning for computers: 

— computers have a limited ‘comfort’ zone requirement for steady core temperature and dry 
conditions, which are cooler and dryer than for people. 
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A so-called ‘swamp cooler’ is a good example of air conditioning - blowing air through a wet 
cloth in hot dry climates causes the air to emerge on the other side sensibly colder - typically 
in British indoor conditions, this will cause a 3 or 4°C drop in the air temperature. This is akin 
to the common experience of climbing out of a swimming pool and feeling cold in air that is 
actually warmer than the water. The temperature difference is due to evaporation — otherwise 
known as evaporative cooling. This difference increases with increasing temperature and 
decreasing humidity — a term of primary importance in air conditioning. 

Figure 2.10 shows an experimental apparatus for air conditioning tests. A duct of approximately 
2 m length has an in-draught fan, a 5 kW steam boiler for moisture addition to the air stream, a 
2 kW air heater, a cooler unit (refrigeration circuit) capable of approximately 3 kW of cooling, 
another heater of 1 kW and an orifice plate to measure the air flow rate (section 3.3 of Part i). 
All of these components affect the condition of the air. 



Figure 2.10 A basic air conditioning unit - fan and refrigerator circuit 


When it is very dilute, water vapour is considered to be a perfect gas. 

Figure 2.11 shows the principle of mixed gases in which gas molecules 
of different gases have different colours to illustrate that they are 
intimately mixed and are undergoing molecular motion and collisions 
with a level of kinetic energy (dependent on temperature) that gives 
internal energy to the system and results in an overall pressure. These 
gases can be imagined separately filling the entire space. They would 
then give a pressure in the space on their own. The law of partial 
pressures observes that when the gases are mixed, their individual, 
separate pressures add up to make the total pressure of the combined gas 
mixture. The pressure ratio is directly proportional to the ratio of the 
number of moles as described in equation (2.12). 

Gibbs— Dalton law (law of partial pressures) 

p = J2 p ' 

where 

p [Pa] mixture pressure 

Pi [Pa] — the partial pressure due to component i 

We will see later that the partial pressure is related to the number of moles or mols (n) of gas in 
the mixture: 


where 

n [mols] - number of moles of a species (subscript i) or of the mixture (no subscript). 



Figure 2.11 Perfect gases have partial 
pressures - the pressure of that amount of 
gas if it alone occupied the entire volume. 
Here molecules of three gases are inter- 
mixed. 


( 2 . 12 ) 
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Note that the number of moles indicates the quantity of a substance. Every molecular substance 
has a molecular mass; that is the mass of a specific number of molecules of the substance. The 
specific number of molecules in a mole is the Avogadro number - 6.23 X 10 23 molecules 
per mole. Since molecules generally have differing sizes, they have different molecular masses. 
The molecular mass is defined in the units g/mol or kg/kmol. 

Air is classified either with or without water vapour as atmospheric air or dry air 
respectively. The properties of dry air are available in tables, and are just that — air without 
any moisture whatsoever. Dry air is a mixture of gases, which can be regarded as existing in a 
universally constant ratio across the globe at sea level, mainly nitrogen and oxygen. The partial 
pressures are determined by the number of moles of each gas present in a given volume, which 
is in the same ratio as the volumetric ratio. Water vapour can be supported in the air by virtue 
of its partial pressure — or its saturation pressure at the temperature of the atmospheric air. 

Worked example 

Atmospheric air consists of dry air and water vapour. 

Dry air, composed of N 2 and 0 2 , by mass the proportions are approximately 
76.7% nitrogen and 23.3% oxygen, and by volume, 79% nitrogen and 21% oxygen. 
What are the partial pressures, assuming atmospheric pressure is one standard 
atmosphere, 1.01325 bar? 

nitrogen - 0.79 X 1.01325 = 0.80047bar 
oxygen -0.21 X 1.01325 = 0.21278 bar 


The practice of boiling water is familiar, and 
it occurs when the saturation pressure of 
a fluid is equal to the atmospheric pressure. 

Water boils at 100°C (the boiling point) at a 
pressure of 1 atmosphere (1.01325 bar). This 
is the saturation temperature, T s , and the 
corresponding saturation pressure, usually 
denoted p r The chart of saturation pressure vs. 
saturation temperature in Figure 2.12 shows 
that p s drops sharply when the temperature falls 
below 100°C. 

Saturation pressure shows the pressure that water vapour exerts on a free water surface because 
of the energetic molecules it consists of. When the saturation pressure is equal to the local 
atmospheric pressure, boiling occurs; it can occur anywhere on a solid surface submerged in the 
liquid, the surface providing nucleation sites for vapour bubble growth. Initially, small bubbles 
grow as they rise through the liquid before bursting through the free surface. 

Apart from boiling, liquid can evaporate from a free surface and condense at a free surface. 

In equilibrium, evaporation and condensation occur at equal rates. The partial pressure of 
the vapour in contact with its liquid will be the saturation pressure corresponding to the 
temperature of the atmosphere. When water vapour is not in contact with a free liquid 
water surface, the partial pressure will be less than or equal to the saturation pressure at that 
temperature. The level of humidity will be determined by the saturation pressure of the liquid 
at the prevailing temperature. 

There are two measures of humidity, which are useful in different ways. Absolute humidity, 
co, describes the mass of water vapour contained by a given mass of air. 

m s (vapour) m s /V v a 
° J= «.(air) = ^7V = V S 
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Figure 2.12 Water vapour and its saturation 
temperature and pressure 
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Note here that mj V = l/v s and mj V = \/v a , and hence the relationship in terms of specific 
volume also. 


The following derivation shows how this formula can be converted from mass fractions ti 
partial pressures: 

Ps 


and 

Therefore, 


Pa V= m a R a T => T7 = Yt 


RaPs 

' RsPa 


Since R a is 287J/kgK and R s is 461 J/kgK and P atmos = p a + p s , this equation becomes: 
287 p s _ 0.622 Ps 

“ _ 461 (p atmos - p s ) ~ (p atm os 2 p s ) 


This relationship is very useful in the analysis of air conditioning systems, as will become clear 
in the subsequent parts of this section. 


The second measure is relative humidity, cp, which gives a measure of how far from 
saturation the atmospheric air is, and is defined by equation (2.14): 

c|> = — (2.14) 

Ps 

Note carefully the redefinition of p s for this equation only — here p s is partial pressure of water 
vapour actually in the air, not the saturation pressure, and p g is the partial pressure of vapour if 
the mixture is saturated at the temperature, T, of the mixture, i.e. p g is the saturation pressure. 
Note that this use of p s is only to be used for air conditioning and not to be confused with the 
saturation pressure also denoted p s in the tables. Here, and only for air conditioning, p g means 
saturation pressure. This is the standard notation for air-conditioning engineers, which is 
unfortunately at odds with the standard notation used elsewhere in thermodynamics. 


A typical range of co is from zero up to about 30 g H 2 0 per kg of dry air at normal earth sea 
level conditions and cp varies between 0 and 100%. 


If the temperature of the air falls until the saturation point or 100% relative humidity occurs, 
the air is at the dew point temperature; this is the temperature at which air becomes saturated 
when cooled at constant pressure (since co = constant, P s is constant during cooling). This can 
be verified by reference to Figure 2.8 or Figure 2.9. 


Examples 

(1) A sample of atmospheric air contains 12 g of water in 1.2 kg of dry air, what is 
the absolute humidity? 

In this case there is 12 g per 1.2 kg, which is a specific or absolute humidity, co, 0.01. This is 
typical of the standard atmospheric condition. 

(2) What is the specific volumetric ratio of moisture to air? 

The inverse of co is the volumetric ratio — i.e. partial volume ratio. The specific volume 
ratio of air to vapour follows, i.e. 0.01 m 3 /kg of air per m 3 /kg of vapour. This shows the 
significant difference in specific volume of the water vapour and dry air. 

(3) If dry air specific volume is 1 m 3 /kg, and partial pressure of water vapour is 
0.016 bar, what is the specific volume ratio and, hence, what is the specific 
humidity? 

Air specific volume, V is 1 m 3 /kg and specific volume of vapour is (from Figure 2.8 or, 
more accurately, from thermodynamic property tables) 83 m 3 /kg, specific volume ratio is 
gj = 0.012, which is the specific humidity. 
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(4) Given a specific humidity of 0.02, and atmospheric pressure of 1.01325 bar, what 
is the partial pressure of water vapour? 

w = 0.02, and p atmos = 1.01325 bar ,p s = therefore, p s = 0.03156bar 

(5) The humidity is 50% in atmospheric air at 20°C; what is the partial pressure of 
the water vapour in the air? 

At 20° C, p x is 0.02337 bar. Given the relative humidity is 50%, the vapour pressure, p s 
must be 0.02337 X 0.5 = 0.01 17 bar. 

(6) If the temperature of the atmospheric air is 35°C and the relative humidity is 
50%, what is the dew point? 

Find p s from the relative humidity, cp, formula with p„ at 35°C.Then identify the temperature 
that corresponds to this, that is the dew point. p g = 0.056 bar (from thermodynamic property 
tables). p s = 0.5 X 0.056 = 0.028 bar. From the tables the temperature that corresponds to this 
as its p g is 23°C.This means that when the temperature falls to 23°C, moisture will condense 
out of the air, either as a cloud if the bulk air temperature decreases, or more usually on sky- 
facing surfaces which cool down due to losing heat to the night sky by thermal radiation. 

Hygrometry or psychrometry 

The analysis of air condition is called hygrometry and the two measures just 
introduced provide the basis for the work of air-conditioning engineers. In order 
to control air condition, we must be able to measure it easily. The basic tools 
of the trade are wet and dry bulb thermometers as shown in Figure 2.13 
The wet bulb differs from the dry one only in that it has a muslin sock around 
the bulb that is soaked in water from a reservoir. If the local atmosphere has 
humidity less than 100% then water will evaporate from the sock and cause the 
temperature of the bulb to drop, owing to the latent heat of evaporation, and 
therefore the thermometer will register a lower temperature. This temperature 
enables us to calculate the relative humidity. 

A spreadsheet can be calculated based on this theory to determine the relative 
and specific humidity from the wet and dry bulb temperatures. The chart in 
Figure 2.14 is called a psychrometric chart and it shows the condition of the 
air for a given pair of wet and dry bulb temperatures. Data represented includes 
relative and specific humidity, enthalpy of the mixture and specific volume of the 

Examples of reading the psychrometric chart 

(1) What is the relative humidity if T DB is 30°C and T wn is 20°C? 

The dry bulb temperature is read from the bottom axis, and constant temperature lines 
rise vertically. The wet bulb temperature is read from the 100% relative humidity curve on 
the left-hand side. Constant wet bulb temperature fines are inclined to the horizontal. The 
intersection of the two lines defines the air condition, and the relative humidity is indicated 
on the curves at 10% intervals. The approximate value for this particular condition is 40%. 

(2) What is the specific humidity if it is 100% humid at 30°C? What does this tell you about 
the relative masses of air and water at 30°C? 

Taking the position on the 100% relative humidity curve, and marking the point when the 
dry bulb temperature is 30°C defines the air condition. Now read horizontally across to 
the vertical axis on the right-hand side to show that the absolute humidity is 0.025. 

(3) What is the enthalpy for a 50% relative humidity at 20°C dry bulb? 

The intersection of the 50% relative humidity curve with the vertical 20°C dry bulb 
temperature defines the air condition. The enthalpy is found by placing a rule on the air 
condition point, and rotating about that point until the enthalpy on the upper enthalpy 
scale and the enthalpy on the lower scale are the same. In this case it is 39kJ/kg. 
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Figure 2.14 The psychrometric chart. Reproduced by permission of the Chartered Institution of Building 
Services Engineers. 
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Figure 2.15 shows the basic principles of 
operation of an air-conditioning unit. Air 
to be conditioned is drawn in by a fan, 
cooled down either to simply reduce the 
temperature or, more commonly, to also 
decrease humidity by reaching dew point 
and causing condensation. Air is heated up 
after the cooler when condensation has 
occurred in order to make the temperature 
comfortable before it is issued to the room. 

The power of the heating and cooling 

is determined from the desired changes in enthalpy of the dry air/ water vapour mixture 
between each of the sections. This must be equal to the energy required to be drawn from the 
external source. The steady flow energy equation is used, applying conservation of mass and 
energy. With reference to Figure 2.15: 

Mass conservation: 


Figure 2.15 Enthalpy exchanges in the cooling section 


At stage 3, there is the same mass of dry air, but some moisture has fallen out of the cooling 
unit (refrigerator coils) to the drain. So we have the same mass of dry air with different 
enthalpy (calculable by dry air temperature and specific heat capacity at constant pressure, 
c p ), and we have a (smaller) amount of water vapour with the air. We also have an amount of 
condensate — the condensed water vapour on the cooling coils. 

Then m a h a \ is ingoing air enthalpy, m s2 is from specific humidity and h s2 from enthalpy of water 
vapour at t 2 ; m w4 is the condensate at the temperature of the cooling coils with h u4 from tables 
as hjirt- 



An air-conditioning unit has a fan that delivers a mass of atmospheric air. 

The mass flow rate of the dry air in this is m a , which is 0.5kg/s. The specific 
humidity is 0.01 kg/kg air, and it can be concluded that the mass flow rate of 
the water vapour, m s2 , is 0.005 kg/s. The ingoing air temperature is 30°C. 

The unit causes a drop in specific humidity to 0.006 kg/kg. What is the rate of 
heat taken out of the air in the cooler section of the air conditioning unit, q? 
What is the pressure of the dry air, p a , and of the water vapour, p s , at exit? 

(1) Calculate p s2 and p s3 using specific humidity-vapour pressure formula. 

(2) Calculate the mass flow rate of dry air and of water vapour at entry to, 
and at the exit from, the cooler. 
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(3) What is the dew point temperature corresponding to p g = p s3 i 

(4) Find c pa ( r from the tables at the appropriate mean temperature. 

(5) Find, from the tables, the specific enthalpy of liquid water at the dew point 
temperature, and of the vapour at points 2 and 3. 

(6) Use the SFEE across the cooler to find Q the rate of heat delivery to the 
air stream. 

0.622 p s 0.622/r. 

using o> = — — = / jQ132 g- - - ^ 0.01 =» p s2 =» 0.01 X 1.01325 - 0.01 p s2 = 0.622 'p a 
0.01013 = 0.632 ft2 => p s2 = 0.016 bar 
p s3 must be 100% 4», .'. p g = p SA t = fa 

0.622 p s3 0. 622p s 3 

to = =>- 0.006 = . ». =>• 0.00608 — 0.006p S 3 = 0.622p s3 

p - p si 1.01325 - p s3 Fsi rsi 

=> 0.00608 = 0.628/j S 3 = 0.0097 bar 

Atmospheric air mass flow rate at entry is given as 0.5 kg/s. We know 
co => 2 = 0-01 and m DRYAIR + m w = 0.5 kg/s .’. m w — 0.005 kg/s 

m DRYAIR 

and 

^dryair = 0.495 kg/s. 

At exit, we must have Wdryair. = 0.495 kg/s still - it can only go out one way. 

We know that to = 0.006, and therefore we say 

Ww 

to = q = 0.006 m w = 0.003 kg/s 

at p s3 , t SAT ~ 6.5°C from tables of saturated steam and water. 

The mean temperature of 6.5°C and 30°C is 18.25°C or 291 K. Using the data for 
dry air at low pressure, 


T 

[K] 

C P 

[KJ/kgK] 

C v 

[KJ/kgK] 

7 

[~] 

p, X 10~ 5 
[kg/m.s.] 

KX 10~ 5 
[kW/m.K] 

Pr 

[~] 

175 

1 .0023 

0.7152 

1.401 

1.182 

1.593 

0.74 

200 

1 .0025 

0.7154 

1.401 

1.329 

1.809 

0.73 

225 

1 .0027 

0.7156 

1.401 

1.467 

2.020 

0.72 

250 

1 .0031 

0.7160 

1.401 

1.599 

2.227 

0.72 

275 

1 .0038 

0.7167 

1.401 

1.725 

2.428 

0.71 

300 

1 .0049 

0.7178 

1.400 

1.846 

2.624 

0.70 

325 

1 .0063 

0.7192 

1.400 

1.962 

2.816 

0.70 


Table 2.1 Dry air at low pressure (Rogers and Mayhew, 1995) 


c p,air)29i k could be interpolated for accuracy, but inspection shows it is 1.0045 kJ/kgK. 
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(5) The saturated water and steam table is required again. 


T 

[°CJ 

Ps 

[bar] 

Vg 

[m 3 /kg] 

h f 

[kJ/kg] 

bfg 

[kj/kg] 

h 9 

[kJ/kg] 

0.01 

0.006112 

206.1 

0 

2500.8 

2500.8 

1 

0.006566 

192.6 

4.2 

2498.3 

2502.5 

2 

0.007054 

179.9 

8.4 

2495.9 

2504.3 

3 

0.007575 

168.2 

12.6 

2493.6 

2506.2 

4 

0.008129 

157.3 

16.8 

2491 .3 

2508.1 

5 

0.008719 

147.1 

21.0 

2488.9 

2509.9 

6 

0.009346 

137.8 

25.2 

2486.6 

2511.8 

7 

0.01001 

129.1 

29.4 

2484.3 

2513.7 

8 

0.01072 

121.0 

33.6 

2481 .9 

2515.5 

9 

0.01147 

113.4 

37.8 

2479.6 

2517.4 


Table 2.2 Saturated water and steam (Rogers and Mayhew, 1 995) 

From this it can be seen that hj- of liquid water at 6.5°C is about 27 kj/kg, h„ at 6.5°C 
is 2512 kj/kg and at 30°C, h g is 2555 kj/kg. 


0.04004 

0.04242 


34.77 

32.93 


121.5 

125.7 


2432.4 

2430.0 


2553.9 

2555.7 


Table 2.3 

(6) Now use the SFEE, which is the statement of the first law (conservation of energy) 
for moving fluids, and ignore the terms from kinetic and potential energy as being 
negligible, Q + W = A H. In the air-conditioning unit between 2 and 3, there is 
no fan or other working device, so W= 0. We need the change in enthalpy of the 
fluids in section 2 to 3 in order to work out the heat transfer to cause it. 

Referring to the schematic, the enthalpy flows coming in at 2 and out at 3 and 
through the condensate collection chute, are all indicated in terms of mass flow rate 
and specific enthalpy. The flow of enthalpy can then be compared in the SFEE: 


" DRYAIR"DRYAIR,2 m DRYAIR"DRYAIR,3 


+ m w , 2 h„ 2 


COND"/>COND 
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We can use A H = mCpAT for the enthalpy change of dry air, which is 
0.495 X 1.004 X (30-6.5) = 11.68kW 

just to cool the dry air. The change in enthalpy of the water vapour carried in the air is 
mainly due to the loss of mass as vapour that condenses. Therefore, with the data for h g 
and mass flow rate of vapour going in at 2 and out at 3, we have 

0.006 X 2555-0.003 X 2485 = 5.32 kW 

The condensate is assumed to leave at 6.5°C, and it has a mass rate of 0.003 kg/s, 
therefore enthalpy flow rate is 

0.003 X 27 = 1.27 kW 
Putting all this in the SFEE, we have 

Q= 11.68 + 5.32- 1.27 = 15.73 kW 

Refrigeration and heat pumps 

In considering a practical air-conditioning plant, the cooling duty must be supplied by a 
refrigeration unit, and the application of refrigeration and heat pumps is therefore considered 
here. Figure 2.18 shows a heat pump in which work is converted into heat to produce a flow of 
heat from cold to hot. It is a heat engine operating in reverse. The process is indicated by arrows, 
essentially this is equivalent to running a reverse of a vapour power cycle with the reverse a 
condensing hot reservoir, an evaporating cold reservoir - a compressor and a throttle. 



Figure 2.18 Refrigeration or heat pump circuit: (a) schematic of the 
reversed heat engine, (b) schematic of the physical circuit required 

Superheated refrigerant vapour is pumped round the circuit by the reciprocating compressor. 
The compressor maintains the evaporator at a low pressure by drawing evaporated vapour 
out of it; since the vapour pressure is low and the saturation temperature is correspondingly 
low - lower than the surrounding cold region — the liquid refrigerant in the evaporator tubes 
evaporates as heat is absorbed from the relatively hot surroundings at the cool temperature. 

The compressor drives the evaporated vapour through to the higher pressure condenser tubes 
at the top. The high pressure means high saturation pressure and therefore high saturation 
temperature, and the vapour is forced to condense since the saturation temperature is higher 
than the hot reservoir temperature. This is a refrigerator too, if the cool side is the useful output. 
Note the symbol for the throttle on the left, which reduces pressure at constant enthalpy. 
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The pressure, p, vs specific enthalpy, h, properties chart is 
the easiest chart to deal with for refrigeration cycles since 
it yields the enthalpy for a process directly The enthalpy is 
the energy content of the working fluid, and hence energy 
balances can very easily be performed with operations 
occurring at two pressure levels. The p-h diagram for the 
commonly used refrigerant R134a is displayed in simplified 
form in Figure 2.19 Note the examples of constant 
temperature lines drawn on in blue (10°C) and pink (40°C). 
On the superheated side of the chart (the right-hand side of 
the domed line), the temperature lines descend in a curve; 
on the subcooled liquid side the lines rise vertically because 
the liquid is incompressible over the range of pressures 
represented. A typical heat pump process is plotted in 
Figure 2.19, showing a cycle that runs in the anti-clockwise 
direction. We know from the earlier description of cycles 
that this direction means that net work is being put into the 
cycle. 



Figure 2.19 p—h diagram of a typical modern refrigerant 


The chart is plotted as follows: 

• horizontal high pressure and low pressure lines across; 

• find the temperature of the evaporator exit at the pressure in the evaporator, on the 
superheated side of the chart; 

• find the temperature of the compressor outlet, which is in the superheated region, at the 
high pressure of the condenser; 

• draw a line showing the process of the compressor between evaporator exit and compressor 


• find the temperature of the outlet from the condenser, which is in the subcooled liquid 
region; 

• a vertical line through the saturated liquid point corresponding to the condenser exit 
temperature defines the process in the throttle, assuming that it is a perfectly constant 
enthalpy device; 


• the change in enthalpy across the three main components 
gives the heat/work transferred in that component. 

The reason for the throttle having no change in enthalpy 
may not be immediately obvious, but in practice the reason 
is basically as follows: we know that the throttle is short and 
should be well insulated, and so Q is nearly zero. There is 
obviously no work in the throttle — no paddles or pistons to 
move the fluid. Use the SFEE Q + W = mAh, where Q and 
W are rates of heating and power input and m is the mass 
flow rate. It is easy to see that if Q and W are zero, then the 
enthalpy entering is the same as that leaving. 

The heat pump circuit relies on vapour- liquid behaviour. 
Vapour enters the condenser tube (Figure 2.20(a)) at a 
temperature above T high and exchanges heat, Q out of the 
condenser to cool the vapour. The vapour condenses when 
it reaches T sat until all is condensed. T sat needs to be a few 
degrees above T hot for it to work properly. The condensed and 
slightly subcooled liquid passes through the throttle (Figure 
2.20(b)), in which the SFEE causes no change in enthalpy 
as noted above. This reduces the pressure of the fluid and 
therefore the T sat reduces to a point just below the 
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Figure 2.20 How the refrigeration circuit works 
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temperature of the cool reservoir (which contains the evaporator). As it enters the evaporator, 
liquid heats up to T sat and starts to boil (Figure 2.20(c)). By the time it reaches the end of the 
evaporator, all is evaporated and the vapour gains a little superheat. 

The heat pump is used not only for refrigeration: the hot reservoir can be the useful heat 
output of the unit. This is the case in ground source heat pumps used for domestic heating. 
This is an effective way of providing heating because the coefficient of performance ensures 
that a significantly higher amount of heat is transferred than the energy supplied to the 
compressor. 

Coefficient of performance 

The heat pump effectiveness is measured in terms of how much heat is transferred either out 
of the cold reservoir or into the hot reservoir, depending on whether it is being used as a 
refrigeration unit or a heat pump. In either case, the cost of pumping the heat is the power into 
the compressor. A measure of how well the heat pump works is therefore the ratio of the heat 
transferred to the work input, and these are given by the enthalpy changes in the evaporator and 
condenser for the heat transfers and by the enthalpy change in the compressor. From Figure 2.20, 
the change in enthalpy is given on the horizontal axis in kj/kg of refrigerant. The difference 
is obviously far greater in the evaporator and condenser than in the compressor. For the cycle 
shown these enthalpy changes are approximately 200 kj/kg, 250 kj/kg and 50 kj/kg respectively. 

200 250 

The ratio is therefore for refrigeration = 4 and for the condenser yyy = 5. These are far 
greater than unity and therefore cannot he stated as an efficiency. This ratio is known as the 
coefficient of performance and is the standard way of quantifying the effectiveness of heat 
pumps. 

Learning summary ^ 

By the end of this section you will have learnt: 

/ atmospheric air is a mixture of dry air and water vapour; 

/ air condition defines the temperature and humidity of atmospheric air; 

/ the law of partial pressures and partial internal energy determines what proportion of a gas 
pressure or internal energy is due to each individual gas component; 

/ there are two measures of humidity: specific (or absolute) humidity - the mass of water vapour per 
unit mass of dry air; and relative humidity - the ratio of the partial pressure of water vapour in the 
air, p s , to the maximum partial pressure of the water vapour at that temperature, p g , which is the 
water saturation pressure; 

/ gases may be quantified in moles, which is a specific number of molecules (the Avogadro number, 
6.023X1 0 26 per kmol); 

/ dew point is the temperature at which atmospheric air becomes 1 00% saturated with water 
vapour (i.e. p s = p g ), such that if the temperature is further reduced, water condenses out of the 
atmosphere; 

/ the psychrometric chart is the key tool of air condition monitoring together with the wet and dry 
bulb thermometers, and the terms psychrometry and hygrometry are introduced to describe the 
study of atmospheric air; 

/ the principles of operation of an air-conditioning unit require enthalpy balances to determine the 
heat power required by heating and cooling in the unit to produce a particular condition; 

/ the heat pump is the generic refrigeration unit that is required to produce cooling in the air 
conditioning unit, and that can also be used to provide heating energy from a cold source; 

/ th ep-h diagram is used to plot heat pump processes, since it quickly yields enthalpy changes 
between points, which represent the heat and work transfers required to produce the changes of 
state; 

/ the coefficient of performance is introduced as the measure of heat pump effectiveness. 
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2.3 Gas mixtures 

There are two ways of analysing gas mixtures — by mass fractions and by volume fractions. The 
law of partial pressures and internal energy leads to analysis by mass — gravimetric analysis. 
Using it enables calculation of properties of a mixture, i.e. h, c v , c„, R,M, s from the individual 
properties combined according to their mass fractions. 


Gravimetric analysis 

The proportions by mass are useful for determining the fluid properties of the mixture. The 
Gibbs— Dalton law of partial pressures presented in Section 2.2 gives the method for doing this 
as follows: 

Deductions from the Gibbs— Dalton law of partial pressures: 

Enthalpy 

H = U + pV 

H = 2.Ui+ 2p,-F 

H = 2H,- (2.15) 

Here, the subscript i indicates a particular species of gas in a mixture of gases, and it is 
comprised as shown here by use of the Gibbs-Dalton Law, of a contribution from each of the 
constituent gas enthalpies. 

Specific heat at constant volume, c v 

From the summation (conservation) of internal energy 
m(u 2 - «i) = 2m,- (« 2i - M l{ ) 
mc v (T 2 -T 1 ) = Xm i c*(T 2 -T 1 ) 

c v = ^c vi (2.16) 

Specific heat at constant pressure, c p 

From the summation (conservation) of enthalpy 

m(h 2 - = 2m,- (h 2i - h ij 

mc p (T 2 — Ti) = 2 m,- c pi (T 2 — T{) 

*= 2^9 (2.17) 

Gas constant R for the mixture 

From Joule’s law 

R = c p — c v 
R = 2^p - 

R = 2 ^ (cpi ~ cj 

R = ^~Ri 


( 2 . 18 ) 
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Apparent molar mass m 



(2.19) 


Entropy 

This can be calculated by treating the mixture as a perfect gas. 

From the first law, dq = -d w + du 

Substituting from the second law and reversible work in a closed system: 


Hence: 


Tds = pdv + du 


Now, du = c v dt and p/T = R/v 

, n dv dt 
ds = R 1 


■W?) 


(2.20) 

( 2 . 21 ) 


This can he converted to alternative forms, i.e. p—T and p-v forms, by using the equation of 


Since pv = mRT, to get the p-T form: v%!v\ = (pi/p^(T 2 /Tx) 

Substitute into the previous entropy formula to give: 

s 2 - », = Rln(^) + + 1 , 1 n(^) 

%_si “ Rin 0 +(R+£jin (f) 

( 2 . 22 ) 

Therefore for mixtures 

m(s 2 — Si) = ~ s^) 

H Mf)- R1 ”(s;)] 

~ h) = 2«»iMv, T _ i) v ,t (2-23) 

Note: Even for a perfect gas mixture, the entropy depends on state rather than temperature 
alone (see h and u, which are just temperature dependent for a perfect gas). 
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Worked example 

Calculate c p for the following fuel and air mixture at 300 K: 
methane 0.1 kg, ethane 0.25 kg, nitrogen 0.75 kg and oxygen 0.23 kg 


Using the tables of thermodynamic properties, the c„ for each of the gases can be read 
at 300 K as methane, 2.226 kJ/kgK, ethane 1.766 kJ/kgK, nitrogen 1.040 kJ/kgK and 
oxygen 0.918 kJ/kgK. Using the formula from above the c p of the mixture is 
(0,1 X 2,226 + 0,25 X 1,766 + 0,75 X 1,040 + 0,23 X 0,918) 


1.33 


- = 1.245 kJ/kgK 


Volumetric and molar analysis 

Volumetric analysis is directly related to molar analysis. So if we have the proportions 
of volumes of the gases reacting, we know the proportions of moles reacting. The analysis 
examined so far is gravimetric, that is the analysis of gas mixtures by the separation of the 
constituents and their estimation by mass. However, in some cases the composition based on 
volumes (or volumetric composition) of gases in the mixture is known. 

Example: The volumetric composition of air is approximately 79% nitrogen, N 2 , 21% oxygen, 
0 2 , by volume. That is in 1 m 3 of air, the partial volumes of nitrogen, N 2 and oxygen, 0 2 are 
0.79 m 3 and 0.21m 3 . 


Law of partial volumes: Amagat’s law of partial volumes 

The volume of a mixture of gases is equal to the sum of the volumes of the individual constituents when 
each exists alone at the pressure and temperature of the mixture 

V, 

V = mixture volume at p, T. 

V { = partial volume of gas component i at p, T. 

V hP, T refers to the volume occupied by gas T at the mixture pressure, p, and mixture 
temperature, T. 

It is important to understand the subscripts that identify the various components. 

Formula for the relation between partial pressure and partial volume 

Partial pressures and partial volumes are alternative ways of describing the mixture. Either a 
component occupies p, at V or V { at P but not V { at p r Using the gas law, considering a perfect 
gas: 

By partial pressure, p, V = mffT 
By partial volume, p Vj = mffT 
Vi Pi 

Divide to give: -y = — (2.24) 

Equation (2.24) shows that proportions by volume are equal to proportions by pressure. So 
if a gas occupies 10% of volume of a mixture, its partial pressure will be 10% of the mixture 
pressure. 
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Molar analysis 

By definition, this is the same as the volumetric analysis but using mole fractions. The mole was 
presented as a measure of substances in the previous section on air conditioning. 

Definitions: 

(1) 1 mol of gas has a mass in kg equal to its molecular weight. 

(2) 1 mol of any perfect gas occupies the same volume V at a given p and T as 1 mol of any 
other perfect gas (e.g. the standard atmosphere condition, V = 22.4 m 3 at T = 0°C and 
p = 1 atmosphere). 

(3) 1 mol of a substance contains an Avogadro number of molecules, N A = 6.025 X 10 23 mol -1 . 

(4) Usually it is more convenient to quantify in kmol. 

Part (2) above tells us that the volume occupied by the same number of molecules of any gas is 
the same. 

The atomic masses of all elements can be found in the periodic table of the chemical elements. To 
get the molecular mass of a molecule made up of more than one atom, add the atomic masses of 
the atoms. For example, carbon dioxide is C0 2 , so for every single molecule of this there is one 
atom of carbon, atomic mass 12 kg/kmol, and two atoms of oxygen (indicated by subscript 2) of 
16 kg/kmol each, so the total molecular mass of C0 2 is 12 + 16 + 16 = 44 kg/kmol. 


Difference between mass and volume fractions 

Consider 1 kmol of hydrogen and 2 kmol of oxygen mixing 
without reacting in Figure 2.21, pictured by volume, at constant 
pressure and temperature. It is known that at the same p and T, 

1 kmol of any gas occupies the same volume - regardless of mass — 
as 1 kmol of any other gas. Since the molecular mass of hydrogen 
is so much less than oxygen, this shows us the effect of molecular 
weight, that vastly different volume and mass fractions of gases can 
occur in a mixture. 

The equation of state for perfect gas is: 

p t V(orpV,) = mRT 


h 2 


20 2 


H 2 + 20 2 

2 kg 

+ 

64 kg 


66 kg 

1 kmol 







2 kmol 





Figure 2.21 Proportions by mass compared to 
volume 


(2.25) 


To calculate by number of mole instead of by mass: 

n = where n is the number of kmol, m is the mass of the sample [kg] and M is molecular 
mass [kg/kmol]. Knowing that the specific gas constant, R is the molar gas constant R, 8,314 
[J/kmolK], divided by the molecular mass of the gas: 

mR= M = nR 

Substituting in equation (2.25) gives: 

Pi V(orpV t ) = nJiT (2.26) 


Molar analysis can be used to determine partial pressure and partial volumes and vice versa. 


Formula for the relation between partial pressure and partial volume and molar 
fraction. 

For the mixture as a whole, considered as a perfect gas, we have: 
pV= nRT 

(Note the molar form of the gas law). Using equation (2.26) gives: 

Pi v niRT 
pV nRT 

f = ? (2.27) 
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Combining with equation (2.24) leads to: 

P i= V i= n L 
p V n 

in which 


(2.28) 


2p,- = p, 2 V { = V and 2k,- = n 

Equation (2.28) shows us the important fact that volume fraction is equal to molar fraction. 
Consider a reaction 2H 2 + 0 2 —* 2H 2 O.The molar fraction of H 2 in the reactants is 2/3. This 
is also its volume fraction. Usually we have a volumetric analysis and a reaction to consider in 
combustion-related calculations. 


Formula for apparent molar mass of mixture, M 

This is calculated from 


and therefore 




by conservation of mass. 

Therefore the molar mass of the mixture is 

Compare this with the relation derived by partial pressure analysis: 

Apparent gas constant of mixture, R 

Since the specific gas constant is defined as: R = Jr 
Therefore, 

r = sr^ t h R 

R Z^n 

and 

J_ = R 

R Z^n 

Conversions between mass and volume fractions 

To convert from gravimetric to volumetric proportions, consider volumetric analysis: 
of constituent, i: 

niiR 

pVj = —~~T (equation A) 

of mixture: pV = T (equation B) 

dividing equation (2.32) by (2.33) gives ~ = (^) (|r) or ~ (^r) 

Note that pC; = nJi'F and k,- = ~Z. 

The conversion can be done in tabular form. 


(2.29) 


(2.30) 


(2.31) 


(2.32) 

(2.33) 

(2.34) 
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Example of conversion from volume fraction to mass fraction 

Typically, a volumetric analysis is stated as, for example, 10% methane, 10% ethane (C 2 H 6 ), 

Vi Hi 

65% nitrogen and 15% oxygen, i.e. -p = — = 0.1, 0.1, 0.65 and 0.15 respectively for each gas. 


Component 

Vj/V = rij/n 

rrij 

(n/ri) m i 

m, tij ~ i 

— = — nrij.-L 
m n 1 m 

A methane 

0.1 

16 

1.6 

0.058 

B ethane 

0.1 

30 

3.0 

0.109 

C nitrogen 

0.65 

28 

18.2 

0.659 

D oxygen 

0.15 

32 

4.8 

0.174 


m = y ]~ L ^~ L = 27.6kg/kmol 
Alternatively for mass to volume fraction: 


Component 

m/m 

rrij 

(nij/m)(Vm) 

n m rrij 

A carbon 

0.09 

12 

0.0075 

0.192 

bo 2 

0.21 

32 

0.0066 

0.169 

cn 2 

0.70 

28 

0.0250 

0.639 


4 = = 0.0391 kmol/kg 

For 0.1 kg of carbon combusting in 1 kg of air. Mass ratio of air is 0 2 = 0.233, N 2 = 0.767. 

Total mass is 1.1 kg. This is useful because C + 0 2 — » C0 2 , which we shall consider in the 
section on combustion. 


Work and heat transferred 

The equations given in this volume and in the first volume for the work done and heat 
transferred during various processes undergone by a single, perfect gas can be used for a 
mixture of perfect gases. That is, the appropriate values of specific heats and gas constants are 
calculated from the values applicable to individual constituents. Therefore knowing c v , c„, R and 
-y for the mixture, all other equations can be used. 


I 


Learning summary 

By the end of this section you will have learnt: 

/ gas mixtures have several gases intimately mixed, which can be quantified in proportion by mass 
or by volume; 

/ the Gibbs-Dalton law of partial pressures leads to proportion analysis by mass, or gravimetric 
analysis; 

/ Amagat’s law of partial volumes leads to analysis by volume proportions; 

/ combining the two with the gas law leads to a useful relationship between partial volumes, 

Pi Vi Hj 

molar proportions and partial pressures of the gases: — = — = — which can be used to make 
p V n 

conversions between gravimetric and volumetric analysis. 


2.4 Combustion 
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Fuels produce energy by reacting with oxidizers. The oxidizer is commonly the oxygen in 
atmospheric air but may be oxygen embedded in a compound that has high oxygen content, 
e.g. nitrates with N0 3 (i.e. one nitrogen atom per molecule with three oxygen atoms) as 
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seen in fireworks in Figure 2.22(a), which contain gunpowder: potassium 
nitrate (75% mass), charcoal (15% mass) and sulphur (10% mass). The fuel 
may he a solid (e.g. coal), a liquid (e.g. petrol, which may be considered as 
octane, C 8 H 18 , or a gas (e.g. natural gas, which may be considered as 
methane, CH 4 , as shown burning in Figure 2.22(b). Since fuels mainly 
comprise hydrogen and carbon, they are called hydrocarbons. If 
sufficient starting energy is provided to initiate a reaction, then the oxygen 
will react with the hydrocarbon fuel in combustion. This section outlines 
how to calculate the amount of air required to burn a given amount of 
fuel, and the composition of the products of combustion formed. 

Solid and liquid fuels are analysed by mass (gravimetric analysis). Gaseous 
fuels are analysed by volume (volumetric analysis). In this section, all fuels 
are assumed to be burnt in air, and it is assumed that air contains 23.3% 

0 2 , 76.7% N 2 by mass or 21 % 0 2 , 79% N 2 by volume as described in 
the introductory part of Section 2.2 on air conditioning. The ratio of 
nitrogen to oxygen by volume is -pp = 3. 76. The ratio by mass is 
= 3.29. These analyses correspond to a mean molar mass m of 28.85 kg/ 
kmol and a corresponding mean specific gas constant R of 0.287 kJ/kgK. 

If the small percentage of C0 2 and Ar are included, then m = 28.96 and 
R = 0.287 kj/kg K.This assumes approximate air data from tables 
(Rogers and Mayhew, 1995). 

Stoichiometric combustion 

Molar reaction equations are used for the analysis of a combustion 
process, which begins by the formulation of the chemical equation that 
shows how the atoms of the reactants are combined to form the products. 

How to balance equations: 
e.g. butane C 4 H 10 + x0 2 —• aC0 2 + bH 2 0 
The unknown quantities are labelled by the unknown number of moles for each, and the 
combustion of only 1 mole (if working in grams) or 1 kmol (if working in kg) of the fuel 
is considered. We know that since matter cannot be created or destroyed, the atoms on the 
reactant side must be present on the products side and so a count of atoms reveals that: 

C:4 = a 

H: 10 = 2b; b = 5 
0:2x = 2a + b;x= 13 
Therefore: 

C 4 H 10 + 130 2 4C0 2 + 5H 2 0 

By the law of conservation of mass, the number of atoms of each element is the same at the 
beginning as at the end of any chemical reaction: 

the number of atoms of reactants = the number of atoms of products 
The reaction equations are molar, i.e. in terms of molecules reacting, and therefore we can 
make use of molar masses to find the masses of each involved in the reaction, and molar 
quantities are equivalent to volume for gases as described in section 2.3 in the subsection on 
molar analysis. Equations conserve: 

(1) number of atoms of each element from reactants to products of combustion 

(2) mass of reactants = mass of products. 

Usually the number of moles (alternatively called mols) of reactants will not equal the number 
of mols of products. 



Figure 2.22 Combustion characteristics 


75 




An Introduction to Mechanical Engineering: Part 2 


In order to convert to mass fractions from mole fractions, use n = =>• m = nin as shown in 

the example below in which: 

n = number of kmol in sample 

»r = mass of sample 

in = molecular mass of the sample 


Example: forming water. 

h 2 + \ o 2 -»■ h 2 o 

lkmol + ^kmol — * lkmol 

1 kmol X 2 kg/kmol + jkmol X 32 kg/kmol -» 1 kmol X 18 kg/kmol 

2 kg + 16 kg —» 18 kg 


The proportions can then be scaled up or down depending on mass of fuel used. Similarly, if 
we have the masses, we can convert to molar equations. 


Note: Incidentally, at this point it is worth noting that the naming of hydrocarbon fuels 
gives away their chemical formula. The prefixes, meth-, eth-, prop-, but-, pent-, hex-, hept-, 
oct- etc. denote the number of carbon atoms in the molecules increasing from one to eight 
in this list. The suffixes -ane and -ene denote the bonding of the carbon atoms to each 
other within the molecules. A carbon atom can he considered as having four links with 
which to connect to other atoms. Therefore methane has one carbon atom with its four 
links connected one each to a hydrogen atom, hence CH 4 . Ethane has two carbon atoms 
with one link connected between them and then each one having three links connected 
to a hydrogen, hence C 2 H 6 . Ethene (Figure 2.23(a)) has two carbon atoms with two 
links connected between them - a stronger bond - and each with only two links each to 
hydrogen atoms, hence the C 2 H 4 . molecule.The schematic representation of butane 
(Figure 2.23(b)) gives the idea of the single carbon links and the single hydrogen links. 
Stoichiometric combustion describes a combustion reaction in which all of the fuel 
and oxidizer is used to make the products, so there is no leftover fuel or oxidizer. 

A combustion process can be: 


©H H 

I I 

c = c 

I I 

H H 


H H H H 


I I 

-c — c- 

I I 


H H H H 

Figure 2.23 Schematic diagram 
of the structure of (a) ethene 
and (b) butane 


Complete: where sufficient 0 2 is available to convert all carbon and all hydrogen in the 
hydrocarbon fuel to C0 2 and H 2 0. There may he excess oxygen in the products. 


Incomplete: where not enough 0 2 is available, and other products such as CO appear. 
Oxygen prefers to react with hydrogen, so all hydrogen is consumed before carbon starts to be 
converted to CO or C0 2 . 


A stoichiometric reaction is one where all the oxygen is used up and all the fuel is burnt to 
the ultimate products (C0 2 , H 2 0), which are the most stable combination of those atoms at 
atmospheric conditions. Stoichiometric combustion describes the most efficient use of oxygen 
and fuel, and it is useful to use the ratio of oxygen to fuel in the stoichiometric case in order 
to compare actual oxygen— fuel ratios. This can be done by volume or by mass. For example, if 
ethane is burned in oxygen, the equation is described by: 


C 2 H 6 + 3j 0 2 2 C0 2 + H 2 0 


The stoichiometric oxygen-fuel ratio of C 2 H 6 (by volume) is oxygen moles + fuel moles 

3.5 Vi «i 

= -j- - = 3.5 — i.e. by number of moles. This can be done because o£~y = i.e. molar 

proportions represent volume proportions. 


Air-fuel ratio 

Most combustion applications use the oxygen in atmospheric air and therefore draw in 
significant amounts of nitrogen to the reaction. Nitrogen is not reactive at atmospheric 
conditions, hut can react at the high temperatures typically found in flames. Although there 
can be reaction of nitrogen it does not happen in significant quantity compared to the more 
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aggressive carbon and hydrogen reactions, and can be neglected with only a very small effect 
on the volume and mass fractions of the other reactions. It is a significant issue for air pollution 
since parts per million are sufficient to cause acid rain. 

The stoichiometric air-fuel ratio (AFR) by volume is found from the stoichiometric oxygen- 
fuel ratio equation, but includes the nitrogen in the air as an inert gas. This is important not 
for the reaction, but for the temperature of the flame - the more gas there is involved in the 
reaction, which has to be heated up from atmospheric conditions to the flame temperature, the 
lower the flame temperature. 

Nitrogen is brought in with oxygen at a fixed ratio of: 

3.76 X (0 2 volume) — from approximate air data. 

Total air volume includes the 0 2 , i.e. air volume is (1 + 3.76) X (0 2 volume). 

The stoichiometric air-fuel ratio (AFR) in the case of the ethane combustion before is: 
AFR volume = (0 2 /fuel) vol ratio X 3.76 + 1 
= 3.5 X 4.76 = 16.7 (by volume) 

This can be converted to stoichiometric AFR by mass. 

Since n — zz 


In practice, the complete reaction equation for the air and ethane stoichiometric case will 
include N 2 terms where N 2 is treated as inert as in the following reaction equation: 


This can either be fuel rich (insufficient 0 2 ) or air rich (excess 0 2 ). In these cases, 
combustion products can contain incomplete products such as CO and free oxygen as well as 
C0 2 , H 2 0, N 2 etc. 

In the case of excess fuel the incomplete combustion of the fuel leads to C0 2 and CO 
products (this is in the case of hydrocarbon fuels). The hydrogen is always the first to burn off 
because it is a much more aggressive reaction than the carbon reactions, so it seizes any oxygen 
present before the carbon can get to it. In real processes, there may be both the oxygen (0 2 ) 
and CO in the products, because of incomplete mixing of fuel and air. 

In the case of excess 0 2 , the products will include free oxygen, but the fuel is burnt to the 
ultimate products. The % excess air supplied is a useful indicator for mixture quality. This is 
defined as: 



m c 2 n 6 


C 2 H 6 + 0 2 + 3^ X 3.76N 2 2C0 2 + 3H 2 0 + X 3.76N 2 

Non-stoichiometric combustion 


% excess air 



min air for stoichiometric combustion 


This is the same by volume and by mass because: 


by volume: — " " — v — - = — 


Vf Vf v m, m a 

— ± — n,,*- — - v — “ = — - 


because" 




.'. Multiply throughout by 



(2.35) 
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Wet and dry products 

Since the products of hydrocarbon combustion contain H 2 0, i.e. steam, which condenses 
when the product temperature is reduced to the atmospheric condition, it is useful to consider 
the products as wet when the water is contained as a vapour and dry when the vapour is 
considered to have condensed out of the products. This gives rise to the terms wet products 
and dry products of combustion. It also affects the energy value of the fuel, such that less 
energy is used when the products escape to the atmosphere containing the water as vapour. 
Notably, modern domestic condensing boilers make use of the energy in the water vapour by 
condensing it in the exhaust stream and capturing some of its energy. A gas analyser can be 
used to measure the component gases of a combustion exhaust stream in order to test how 
complete the combustion is, and it informs about the state of mixture of the incoming gases. 


Worked example 

Determine the stoichiometric AFR: 

(1) by mass (2) by volume 

for propane, C 3 H g . Calculate the volumetric composition by wet and dry 
analysis of the products. If the fuel is burnt with 30% excess air, what are t 
volumetric and gravimetric composition of wet products? 

Aim: 

To realize stoichiometric air-fuel ratio by mass and by volume. 

To realize the importance of wet and dry analysis of product gases. 


Stoichiometric air to fuel ratio (AFR): 

(1) by mass 

Reaction equation of the fuel: 

C 3 H 8 + 50 2 -* 3C0 2 + 4 H 2 0 
44 kg + 5 (32 kg) 3 (44 kg) + 4(1 8 kg) 

32 

The oxygen— fuel ratio by mass is therefore 5 X — = 3.636. The air— fuel r; 
worked out using the approximate ratio of air to oxygen in atmospheric ail 
1:0.233 by mass: 

air[kg] oxygen [kg] 1 
fuel[kg] fuel[kg] 0.233 
The AFR by mass is therefore 15.6. 

(2) by volume 

The oxygen— fuel ratio by volume is 5:1 from the reaction equation above. To find 
the AFR by volume, use the approximate ratio of air to oxygen in atmospheric air, 
which is 1:0.21 by volume: 

air[kmol] oxygen[kmol] 
fiiel[kmol] 


= 3.636 X 4.29 


0.21 


= 5 X 4.76 


fuel[kmol] 

The AFR by volume is therefore 23.8. 

The complete representative equation with all mixture components of the air-fuel 
stream includes nitrogen in the approximate ratio for atmospheric air. Including this 
in the reaction equation gives: 


C,Ho 


5 0 2 + 


-n 2 \ 


» 3C0 2 + 4 H 2 0 + 5 X 


-N 2 
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In order to get a volumetric analysis of wet and dry products, use a table for 
compactness. Wet includes the water in the reaction; dry does not include the water. 



Wet 


Dry 


Component 

Vi 

~V 

Vi 

Vi 

V 

co 2 

3 

0.116 

3 

0.138 

h 2 o 

4 

0.155 

0 

0 

n 2 

18.81 

0.729 

18.81 

0.862 

2 

25.81 

1 

21.81 

1 


When there is 30% excess air in the reacting stream of air and fuel, there will be 30% 
more 0 2 and N 2 in the representative equation: 

C 3 H 8 + 5 X 1.3(02 + °' 7 %.2i^) 3C0 2 + 4 H 2 0 + 0.30 2 + 5 X 1.3 X °- 7 % l21 N 2 

The list of products can be made for gravimetric and volumetric analysis: 


Component 

V 

Yi 

m 

~ v i 

m i ~ m ‘ V 

ntj rhi Vi 

m~ m V 


[kmol] 

H 

[kg/kmol] 

[kg/kmol] 


co 2 

3 

0.091 

44 

4 

0.141 

h 2 o 

4 

0.122 

18 

2.187 

0.077 

o 2 

0.3 

0.046 

32 

1.456 

0.051 

n 2 

24.45 

0.742 

28 

20.78 

0.731 

2 

32.95 

1 


2 = m = 28.4 

1 


Note that the column for the individual mixture species components has units kg/kmol 
and not just kg. This happens because, when multiplying by the volume fraction, we are 
considering per unit volume, and that is equivalent to per kmol of the total product gases. 
The sum of this column strictly speaking makes up the molecular mass of the product 
gases as an incidental by-product because we use the kmol as a convenient total sample 
size of the product gases. The final column calculating the ratio of each gas to the total 
mass of product gases uses the molecular mass of the previous column. 


Worked example 

Benzene C 6 H 6 is burned in air and dry products of 
combustion contain 2% CO by volume and no free 
oxygen. Determine the stoichiometric air-fuel ratio 
by mass and the actual air ratio by mass. 

Aim: 

To calculate excess air supplied. 

To show how to use the air— fuel ratio to indicate 
substoichiometric combustion. 

The benzene molecule is complex, in the form of 
a ring with a mixture of double and single links 
between the carbon, as shown. 



Figure 2.24 Schematic 
diagram of the 
structure of benzene 


► 
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For the stoichiometric combustion case, the reaction of the fuel with oxygen can be 
written: 

C 3 H 6 + l\o 2 -* 6 C0 2 + 3 H 2 0 
78 kg + 7y(32 kg) - 6(44 kg) + 3(18 kg) 

The oxygen-fiiel ratio by mass is therefore l\ X 32:78 = 3.077. 

The AFR by mass can be calculated using the approximate ratio of air to oxygen in 
atmospheric air by mass, which is 1:0.233 = 4.29: 

= ox ygen[kg] air [kg] 

fuel[kg] fuel[kg] oxygen[kg] 


The AFR by mass is therefore 13.2. 

For the actual combustion, insufficient oxygen is supplied to burn the fuel completely. 

We assume that the order of reaction of the atoms in the fuel is hydrogen first completely, 
since it is the fastest and by far the most aggressive. Subsequent to this, the carbon burns, 
firstly all to carbon monoxide and then some CO reacts to form C0 2 to consume the 
remaining oxygen. The lack of sufficient air therefore leads to some CO remaining in the 
exhaust gas stream. 

We assume in the representative reaction that there is a proportion of the stoichiometric 
0 2 required, and write: 

C 6 H 6 + y(0 2 + °- 79 / 0 . 21 N 2 ) -* aC0 2 + bCO + cH 2 0 + 0 - 79 / 0 . 21 JV 2 

The first step is to conduct an atomic balance in terms of y, such that when the proportion 
y is known, the equation balances identical quantities of elements on each side. 


Carbon: 6 = a + b 
Hydrogen: 6 = 2c, i.e. c = 3. 

Oxygen: 2y = 2a + b + c and substituting c, and 

a = 6 - b, 2y = 12 - 2b + b + 3 or b = 15 - 2y. 


The second piece of information is the proportion of CO in the dry products by volume, 
which is 2%. We therefore create the proportion from the above equation and equate to 


0.02: 


And substituting b = 2y 


a + b + 3.76y 
15 and a + b = 6: 

15 - 2y 
6 + 3.76y “ ( 


Therefore, y = (15 -0.12)/(2 + 0.0752) = 7.16. 

The AFR by mass for the actual combustion process is: 

air[kg] 7.16 X (32kg/kmol + 3.76 X 28kg/kmol) 983 
fuel[kg] = 78 kg = "78~ 


■ Worked example 

A particular coal has a mass analysis of 81% C, 5% H, 5% O, 9% ash. The 

I volumetric analysis of the dry products is: 

10% C0 2 , 1% CO, 8% 0 2 , 81% N 2 

Determine the minimum air required for complete combustion of 1 kg of coal 
and the percentage of excess air required to produce the product gas proportions. 
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Aims: 

To realize the ash content effect on the air-fuel ratio. 

To determine the stoichiometric reaction. 

To realize the effect of oxygen in coal on the stoichiometric reaction. 
To calculate excess air supplied. 


In this case it is useful to determine the required oxygen by working out for each 
component how much oxygen is required to burn it individually. So instead of a 
representative reaction equation, it is more obvious to create a table: 


Constituent 

Reaction 

Mass per kg 
of coal 

Mass of 0 2 
per kg 

Oxygen 

required 

carbon 

c + o 2 -*> co 2 

0.81 

32/12 = 2.67 

2.16 

hydrogen 

h 2 + %o 2 h 2 o 

0.05 

16/2 = 8 

0.4 

oxygen 


0.05 

-1 

-0.05 

ash 


0.09 

0 

0 


Adding the oxygen required in the last column, per kg of coal there is 2.51 kg of 
0 2 required. For complete combustion, the air required for 1 kg of coal is this mass 
multiplied by the air:oxygen ratio by mass in the atmosphere, which is 1:0.233 = 4.29. 

Therefore complete combustion requires 4.29 X 2.51 = 10.77 kg. 

In order to find the actual air supplied, we have to take account of the information presented 
by the constituents of the product gas stream. This contains nitrogen that can only have come 
from the air supplied. The carbon came in a fixed proportion of the coal, and if we can work 
out the nitrogen to carbon ratio, we can infer the aincarbon ratio, and hence the airxoal 
ratio. The other useful information is that the ratio of products is given by volume, and we 
therefore know the molar proportions, which then give not only the mass proportions of 
each species molecule, but also the mass proportions of the species atoms. 

For 1 kmol of product gases, there is 0.81 kmol of N 2 , with mass 0.81 X 28 = 22.68 kg. 
This is carried in with a proportion of air by mass of 1:0.767 or = 29.57 kg. 

There is 0. 1 kmol of C0 2 , containing 0.1 kmol of carbon, with mass 0.1 X 12 = 1.2 kg, 
and there is 0.01 kmol of CO, containing 0.01 kmol of carbon, with mass 0.01 X 12 = 
0.12 kg. The total amount of carbon in 1 kmol of product gas is therefore 1.32 kg, which 
is carried in the proportion 1:0.81 of coal to carbon. Therefore the mass of coal to carry 
1 32 

1.32 kg of carbon is ^ = 1-63 kg. 

The air to coal ratio is therefore 29.57:1.63 = 18.14 by mass. 


Therefore percentage excess air is: 

excess air % = 
or 68.4%. 


18.14 - 10.77 
10.77 


How solids burn 

Coal, plastics and wood all burn in an oxygen environment, and the pattern of combustion is 
very familiar if not formally recognized. The combustion has characteristic stages as follows: 
initial heating with evaporation of any moisture content up to the temperature at which 
polymers melt and begin to evaporate. This stage is called devolatilization involving the 
evaporation of volatiles - low boiling point hydrocarbons in the fuel. The volatile molecules 
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burn in the surrounding gas giving the familiar flame of airborne combustion. There then 
follows smouldering combustion of the char content that is left behind, until only the ash 
is left. Char is carbon, which is solid, does not evaporate and relies on diffusion of oxygen 
coming in through the pores to burn. As long as the diffusion rate is sufficiently fast to produce 
a reasonable amount of combustion, whilst insufficient cooling occurs, then the char remains 
at high temperature sufficient to initiate combustion at new sites penetrating deeper into 
the body. This type of solid-bound combustion has a characteristic glow dependent on the 
temperature of the char. Figure 2.25 shows the early stages of combustion of a piece of rubber, 
showing a sooty flame as volatile molecules are released and burnt, and some are cooled faster 
than they are burnt, forming carbon-heavy particles or soot. 



o o © o 


Figure 2.25 Initial stage of combustion of a (a) ignition; (b) 30s; (c) 60s; (d) 120s piece of car tyre rubber in a 
900°C furnace 

Combustion energy 

This section describes how to calculate the energy released by combustion, and the 
temperature of the products. The temperature affects the mechanical strengths of materials 
of the combustion vessel as well as thermal efficiency. Combustion, just like other processes 
considered before, can be a closed system process as in an internal combustion engine, or 
an open process as in a jet engine or boiler furnace. As before, it is the first law that we are 
interested in. The two forms are the direct first law and the derived SFEE for each of the 
processes. Combustion releases chemical energy, which is converted to internal energy in the 
closed system and enthalpy in the open system. 

Internal energy (A U 0 ) of combustion 

The closed system is the easiest to consider to start with, that is for non-flow processes. To use 
the first law of thermodynamics, Q + FF = A U, we need to know, when energy is released from 
combustion, how much is converted to heat and work, and how much is taken to increase the 
internal energy of the working fluid. A(7 0 is the internal energy of combustion at reference 
conditions of temperature T 0 (usually 25°C) and of pressure p e (usually 1 bar). This is determined 
from the first law applied to a constant volume, closed system process, as shown in the following. 

For an analysis of closed system combustion, refer to schematic diagram of the process in 
Figure 2.26. 


V 0 ,T 0 


(1) 

Reactants 

mixture 



(2) 

Combustion 



(3) 

Cooling 


Figure 2.26 Schematic diagram of combustion in a closed vessel 


(4) 

Products 
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(1) Reactants are brought together; at V (h T 0 , the fuel and air mixture is ignited and burnt. 

(2) Combustion then takes place; this releases chemical energy which is transformed into 
‘sensible’ internal energy, that is kinetic energy of the molecules, which dictates the 
temperature of the mixture. Some changes to properties will also occur and the chemical 
species will change. Still atV 0 , but temperature will have increased. 

(3) and (4) Products are cooled back to T 0 . Notice that bringing the products back to 

temperature T 0 is a standardizing procedure - it allows us to consider, in a standard way, the 
quantity of heat available from a combustion reaction. 

Apply the first law to the processes (a) to (c): 

Q = - W+ U c ~ U A (2.36) 

But there is no work because it is a constant volume process. 

Q=U p0 ~ U, o (2.37) 

where subscript p indicates product species mixture and r indicates reactant species mixture. 

Q = U p0 - = A U 0 (2.38) 

Since Q is negative, because heat is transferred from the system to surroundings, then 
L 7 , 0 > UpQ and the internal energy of combustion is negative. Internal energy of combustion 
tells us how much heat had to be removed to convert the reactants to the products and bring 
the products back to the standard state temperature. So Q in equation (2.38) is the heat released 
by the reaction. 


Enthalpy (A H 0 ) of combustion 

For steady flow process combustion, the situation is similar except the steady flow energy 
equation Q + W = A H (neglecting KE and PE) is used. In the same way as for the closed 
system, in which a standard state internal energy of combustion is used, in the open system, 
AH (I , the enthalpy of combustion at reference temperature T 0 (25°C) and pressure p 0 (1 bar) 
is used. Note the subscript zero, means at standard condition, i.e. at 25°C. 


Analysis of open system combustion 

A H 0 can be determined from the SFEE for a flow 
process with no work, and with heat transfer to 
restore the temperature of the products to initial 
temperature of the reactants, T 0 . Consider the 
situation shown in Figure 2.27, in which the 
combustion takes place from the standard state 
condition at 1 (i.e. temperature isT 0 ), releasing 
energy. The combustion energy released in 
the flame is removed through the walls of the 
system by heat transfer in order to reduce the hot 
combustion product gases to the initial temperature 
of the reactants. Writing the steady flow energy 
equation including kinetic energy for now: 

Q 2 (2 2 

H x + m~ + Q(-ve) = H 2 + m y- 




' J 

2 

Reactants 1 

Ifr- 1 

Combustion 

— ► 


T 0 

T 0 


Figure 2.27 Open system combustion model. Note that 
the beginning and end conditions are forced to be at the 
standard state. 


Q=(H 2 - HO + 


(2.39) 


An Introduction to Mechanical Engineering: Part 2 

The kinetic energy terms are often negligibly small compared to the enthalpy and equation 
(2.39) simphfies to 

Q = H Po - H Ro = A H Q (2.40) 

For a combustion reaction, Q is negative, i.e. the system produces heat that is extracted from 
the working fluid via the containing system walls, in order to bring the products back to the 
standard condition temperature. Since the flow is in a duct of some kind, it is often the case 
that pressure does not change significantly across the flame. 

Converting between A H 0 and A U Q from H = U + pV 

Values of enthalpy of reaction are stated in data tables, but internal energy of reaction is not 
necessarily stated. If we want to know the internal energy of reaction for a closed process, then 
we must be able to work it out. It is therefore useful to be able to convert from one to the 
other, and this is based on the formula for enthalpy as follows: 

A H 0 = H p o - H ro = (AU p0 + A Pp0 V p0 ) - (AL7 ro + p R0 F R0 ) 

A H 0 =(A U p0 -U K0 ) + (Ajy, VpQ — p R0 F ro ) 

A H 0 = A U 0 + (Ap p0 V p0 - Pm F ro ) (2.41) 

— For solid and liquid components, PV is negligible 

— For perfect gas components, p V = nRT, therefore 

where A JT 0 = AU 0 + (A n p - Mr) R t0 (2.42) 

n p = kmols of products per kmol of fuel burnt. 

Mr = kmols of reactants per kmol of fuel burnt. 

A H 0 and A U 0 are in kj/kmol of fuel burnt. 

Usually energy contributions are considered in terms of mass, and so conversion from kj/kmol 
to kj/kg must be done before use in the steady flow energy equation. Enthalpy of reaction 
is stated for common reactions in the data tables. There is a very easy way to work them out 
from enthalpy of formation for the molecules involved in the reaction which are given in 
the tables in order to determine reaction enthalpy for a wider range of reactions. Formation 
enthalpies are stated in kj/kmol for molecules, and represent the energy required to form the 
molecule from the constituent atoms. Atoms have formation enthalpies dependent on their 
structure, but are often zero. In particular, all diatomic molecules (e.g. N 2 , 0 2 ) have formation 
enthalpies of zero. In order to calculate the energy released by a chemical reaction, write the 
reaction down, write the formation enthalpy of each molecule below the reaction equation, 
multiplying by the number of moles involved in the reaction. The difference between the 
sum of reactant formation enthalpies and the sum of the product formation enthalpies is the 
reaction energy. For example considering the previous example of propane: 

C 3 H 8 + 50 2 -* 3C0 2 + 4H 2 0 
44 kg + 5 (32 kg) 3 (44 kg) + (18 kg) 

-118 910 + 5 X 0 ^3 X -393 520 + 4 X -241830 

The enthalpy on the reactant side is -118 910 kj and the enthalpy of the products is -2 147 880 kj. 
Therefore for the combustion of 1 kmol of propane, the reaction energy released is 
-2 147 880 -(-118,910) = -2 028 970 kj 
Where formation enthalpies are taken from data tables in kj/kmol. 

This is a lot of energy, but it is important to remember that it comes from 1 kmol of propane, 
which weighs 3X12 kg + 8X1 kg = 44 kg, which is a substantial amount of fuel. 
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Calorific values 

A h 0 and A u 0 are usually given at 25°C, per kg or kmol (h) of burnt fuel. In the tables, A h Q is 
given and stated as A/i° kj/kmol at T = 25°C.The phase of the reactants and products, which 
could be vapour or liquid (e.g. H 2 0) should be specified. Alternatively, calorific values are 
quoted instead of A h 0 and A u 0 . These are defined according to British Standard. 

Gross (or higher) CV at constant volume (Q Ar> „), A U 2 5 °c> with HiO in products as liquid. 

Net (or lower) CV at constant volume (Q„ e( „), AL 7 25 o C , with H 2 0 in products as vapour. 
Gross (or higher) CV at constant pressure (Q A , rp ) AH 25 . c , with H 2 0 in products as liquid. 

Net (or lower) CV at constant pressure (Q„,., jP ) AH 25 . c , with H 2 0 in products as vapour. 

By convention, calorific values are stated as positive values. However, we know that according to 
the European first law convention, positive energy transfer is into the system. Since a combustion 
reaction produces heat out of the system, combustion energy is indicated as negative for use in 
the first law equations. The difference between gross and net is due to the state of the water in the 
product gases. Look at tables of formation enthalpies (Rogers and Mayhew, 1995): H 2 0 (liquid) 
is —285 820 kj/kg and H 2 0 (vapour) is —241 830kJ/kg.The difference is the enthalpy difference 
between liquid and vapour water at 25°C.This is the contribution of the H 2 0 content to the 
overall product enthalpy. Hence the higher calorific value (i.e the more negative the heat removed 
to restore to the standard temperature) is with the water as liquid because it reclaims the energy of 
condensation, which in turn must be removed from the system by negative heat transfer. 

Gas mixture conditions when inlet and outlet 
are not at standard conditions 

Consider the closed system combustion, similar to the situation in Figure 2.26, but this time 
with beginning and end conditions that are different from the standard state. If initial and final 
states are not at the reference state temperature T 0 , then artificial processes are considered in 
order to adjust conditions such that combustion takes place at the reference temperature T 0 . 

The reactants and products are assumed to be perfect gases (plus liquids and solids) so that h and 
u depend on T and only T. With these assumptions, we can use A u = c„AT and A h = c p A'F 
for gas mixtures heating and cooling, or more accurately the absolute values of h and u directly 
from tables, to calculate the difference in enthalpy and internal energy between reference and 
actual states. Values of the molar enthalpy or energy content (note that now it is just ‘enthalpy’ 
not ‘formation enthalpy’ or ‘combustion enthalpy’) and the molar internal energy are given for 
common gases in the tables over a large range of temperatures, so they can be read from there. 
The specific values, in order to consider fractions by mass rather than volume, are calculated by 
dividing by the molecular mass, e.g. h [kj/kmol ]/m [kg/kmol] = h [kj/kg]. 


Worked example 

What is the specific enthalpy change for carbon dioxide between 25°C and 327°C? 

For C0 2 molar enthalpy at 327°C or 600K is 12 916kJ/kmol. At 25°C or 298.15K it is zero. 

So h 60 ok — ^ 298.1 5k = 12 916 kj/kmol, and since m C02 is 44 kg/kmol, specific enthalpy 
change is /i 600K - ^ 298 K = 293 kj/kg. 

The alternative method of computation is to use specific heat capacity. c p = 0.846 kJ/kgK 
at 300K and c p = 1.075 kJ/kgK at 600K. 

K ook— 298K = mc p AT = 1 X ((0.846 + 1.075)/2) X 300 = 288.15 kj/kg. 

The first method is more accurate since it makes no assumptions. The second method 
assumes that the change of c p with temperature is linear — it is not, quite. 
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Closed system combustion 

Having determined that internal energy and enthalpy changes of gases between different 
temperatures can be determined, we can now return to the question at hand, which is to 
determine the enthalpy difference (i.e. the internal energy difference for the closed system) 
from the standard state for the beginning and end states of combustion. Figure 2.28 shows 
a schematic of combustion in a closed cylinder with a moveable piston for extracting work, 
similar to an internal combustion engine. In the analysis of the combustion, we know what the 
energy release is at standard conditions, so our analysis breaks the process from state 1 to state 2 
into a number of artificial steps: 

(1) start with the gas at the initial conditions V\, 7\; 

(2) make the gas go from the initial conditions v x , T x to the standard conditions u 0 , T 0 and 
note the heat and work to do this; 

(3) as in the analysis for Figure 2.24, combust; 

(4) as in analysis for Figure 2.24, cool back to T 0 noting the energy release to do so; 

(5) make u () T 0 , standard conditions go to final conditions v 2 , T 2 noting the heat and work to 
do this. 


Products 



Figure 2.28 Closed system combustion case with conditions at beginning 
and end that are different from the standard state. A cylinder is depicted 
similar to an internal combustion engine, and three intermediate artificial 
conditions are inserted to aid in the analysis of the process. 


Note that there are only two real states: at the beginning conditions 1, and at the end 
conditions 2. Three intermediate artificial conditions are inserted between the beginning 
and end states to simplify the analysis because we know what happens with energy release 
at standard conditions. This assumes that we know the start and end conditions and wish to 
know the energy released. 

From applying the first law to the states (1) to (2) 

Q12 + W-12 = U 2 -U 1 but 
U 2 ~U t = Up 2 - U K1 

U 2 -U x = (U p2 - Upo) - (U R1 - U K0 ) + (U p o - U R0 ) 

u 2 - U x = m MT 2 -To)-J2 “ r o) + Au o (2.43) 

PRODUCTS REACTANTS 


Where U, with capital, indicates extensive internal energy for the whole quantity of gases 
considered. 
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Or with tables for values of internal energy: 

U 2 ~ u i = ^ m ' ( ' u ' 2 ~ w '°) m >( Mil _ M >'°) + m &eA«o (2-44) 

PRODUCTS REACTANTS 

Note that internal energy does not depend on pressure for a perfect gas. Given the conditions 
that occur between states 1 and 2, heat and work exchanges with the surroundings can then 
be calculated. For example, if we know that the system is perfectly insulated, then the energy 
change must be the work done by the surroundings on the working fluid; alternatively, if 
there is no machinery to extract work then the energy must be exchanged as heat with the 
surroundings. 

Open system combustion 


For an open system case 
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Figure 2.29 Open system combustion schematic in which the 
beginning and end states are at the standard condition 


The open system can be handled in the same way as the closed system: apply the same idea as 
for the closed system of bringing reactants to the standard state T 0 , and the products returned 
from T 0 to actual state 2. 

Apply the steady flow energy equation, noting the superscript dots to indicate flow rates: 

Q 2 c 2 

Hi + nrf- + Q12 = ~W l2 + H 2 + m -y- 

Qi 2 + = (H 2 - Hi) + m^f- - tf- 

The kinetic energy is negligible compared to the enthalpy terms: 

Ql2 + W u =H 2 -Hi 

Note that condition 2 is for the products and condition 1 is for the reactants: 

Q12 + = H V2 - H K i 

Now, similar to the treatment of the closed system case, reduce the product condition to 
standard state, and the same for the reactants: 

Q12 + Wi 2 = (Hp 2 - Hpo) - (H R1 - H ro ) + (H P0 - H ko ) 

Note now that the last enthalpy difference term is the combustion enthalpy at standard 
conditions, and using the expression for enthalpy from specific heat capacity at constant 
pressure: 

Q12 + ^12 = 2m lV <T 2 - T t ) - Zm t c pi (T, - T 0 ) + A H 0 
Alternatively, use the enthalpy of the gases from tables for a more accurate solution, which gives: 
Q12 + W u = I.m { {h a ~ h i0 ) - 'Zmihn - h i0 ) + A H 0 (2.45) 
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Worked example 

A mixture of CO and air is 10% fuel rich by volume and at 8.28 bar and 
555K. The mixture burns adiabatically and at constant volume. Calculate the 
products temperature neglecting dissociation. 


The representative stoichiometric reaction equation gives the ideal reaction, which shows 
the required oxygen and nitrogen to thoroughly combust all the fuel: 

CO + tt 0 2 + I X 3.76N 2 -* C0 2 +|x 3.76N 2 

We know that the reaction is not stoichiometric, because we are told that the mixture is 
10% rich - i.e. there is 10% more CO than can be burnt with the stoichiometric air in 
the above equation. Therefore, the true representative reaction equation will have 1.1 CO, 
thus: 

1.1CO + \o 2 + \ X 3.76N 2 -* C0 2 + | X 3.76N 2 + O.ICO 
Alternatively, divide by 1.1 to get the reaction in terms of combustion of 1 kmol of fuel: 
CO + 0.91(|0 2 +{x 3.76N 2 ) 0.9CO 2 + 1 71N 2 + O.OCO 

In order to determine the final temperature, we need to analyse the closed system with 
£ the method described above: 

u 2 - Ui = - TO - m icv i(T 2 - TO + AU 0 

PRODUCTS REACTANTS 

a The internal energy of combustion can be obtained from the tabulated values of enthalpy 
of combustion and the equation to convert between the two: 

A H 0 = A U 0 + A (n p - »r) RT 0 

A k 0 for CO is —283 OOOkJ/kmol for the reaction CO + y0 2 — > C0 2 . 

Aw 0 = A h 0 - (n p - n^RT 
A u 0 = -283 000 - (1 - 1.5)8.314 X 298 
Therefore the internal energy of combustion per kmol of CO is -281 761 kj/kmol. 

To put this in specific terms, divide by the molecular mass of CO, 28 kg/kmol, to give 
-10 063 kj/kg. 

The first law can now he applied, for adiabatic (Q = 0) and constant volume {W = 0) to 
give: 

U 2 -Ui= 0 

Where U 2 is the total internal energy of the products at the end of the process and is 
the total internal energy of the reactants before combustion. Knowing the temperature 
of the reactants, we can work out the internal energy of the reactants before combustion. 
We don’t know the final temperature of the products - this is what is asked for. Using 
equation (2.44): 


u 2 - t/i = J2 m ‘ c ^ T2 ~ To) ~Y1 ■ T ° } + Au ° 

PRODUCTS REACTANTS 
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And reading values of molar internal energy from the tables (Rogers and Mayhew, 1995), 
to complete the following table of reactants: 


Species 

ttj kmol 

M i,555K kj/kmol 

M l M «, 555 K kj/kmol 

CO 

1 

2984 

2984 

o 2 

0.455 

3233 

1471 

N 2 

1.71 

2944 

5034 

2 



9489 


where the internal energy at 555K is found from the tables by interpolation. Values are 
stated at 400K and 600K for a number of gaseous species. The interpolation is done thus: 
~ _ 555 - 400 ~ ~ ~ 

«/,555K- 60Q _ 4 o 0 U9,600K «,,400k1 + «,'400K 

The reader can easily confirm that the values in the table are correct. 

There is no direct way to obtain the product temperature since it is determined by 
balancing the equation. The method used is to guess a temperature and see how far out of 
balance the equation is. In this case, guess 3000K. 

Making the same table as above for the products 


Species 

n j kmol 

m ;,3000k kj/kmol 

n i u ;,3000K kj /kmol 

co 2 

0.91 

127 920 

116 407 

CO 

0.09 

68 598 

6174 

n 2 

1.71 

67795 

115 929 

2 



238 510 


Inserting the values in the equation: 

U 2 -Ui = 238 510 - 9489 + (-281 761) = -52 740 kj 
This is negative, and shows that the products were not hot enough. Therefore try the next 
temperature up in the table, i.e. 3200K: 


Species 

M; kmol 

««,3200K kj/kmol 

«;»;,3200K kj/kmol 

co 2 

0.91 

138720 

126235 

CO 

0.09 

74 391 

6695 

n 2 

1.71 

73 555 

125 779 

2 



258 709 


And the equation now is 

U 2 - U, = 258709 - 9489 + (-281 761) = - 32 540 kj 
This is still too cool so the process is repeated until the answer is acceptably close to zero. 

Learning summary 

By the end of this section you will have learnt: 

/ combustion involves mixing a fuel (hydrocarbon) and an oxidizer, in order to produce heat whilst 
converting the chemicals involved in the reaction into reaction products; 

/ in this unit, the hydrocarbon fuels considered only have hydrogen and carbon composition, and 
produce the ultimate product gases carbon dioxide and water vapour; 

/ molecular reaction equations can be balanced by counting the atoms of all molecules on the 
reacting side of the equation, and matching the number of atoms of all molecules on the product 
side of the equation; 
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/ a stoichiometric combustion reaction is ideal: every carbon atom combines with exactly one 
diatomic oxygen molecule to produce a carbon dioxide molecule, and every hydrogen atom 
combines with one other hydrogen atom and one oxygen atom to produce water vapour; 

/ when there is either too much or too little oxygen, incomplete products of reaction appear 
including carbon monoxide and oxygen. The reaction is then non-stoichiometric; 

/ most practical combustion installations use atmospheric air, which carries oxygen as the primary 
oxidizer. The oxygen in the air brings with it a fixed proportion of nitrogen. The combined 
requirement for oxygen with its associated nitrogen gives an air requirement that leads to the air- 
to-fuel ratio by mass or by volume; 

/ for non-stoichiometric combustion, the excess air ratio is defined as the ratio of excess air 
supplied to the air required for stoichiometric reaction; 

/ the product gases may be considered by proportion with or without moisture, since the moisture 
will condense out of the gases when they cool down. The alternatives are known as wet and dry 
products of combustion; 

/ each reaction has an enthalpy of reaction at standard conditions, which is determined by the 
difference between the sum of formation enthalpies of the reactants and products; 

/ standard conditions are 1 atmosphere pressure and 25°C; 

/ the enthalpy of reaction is referred to as the calorific value of the fuel, and may be considered as 
gross, in which case the water is considered as condensed out of the gases, or as net, in which 
case the water is considered as vapour in the gases; 

/ enthalpy of reaction is directly used in flow process combustion but must be converted to internal 
energy of reaction for closed process combustion; 

/ the final temperature of a combustion process can be determined by an enthalpy balance (or 
internal energy balance for closed processes). 


HP piston- 


2.5 Reciprocating compressors 

Reciprocating piston machines are typically needed to provide a supply of compressed air, 
but are also used for other positive displacement requirements such as refrigerator pumps and 
vacuum pumps. They can be described as single-acting or double-acting (compression on 
one or both sides of piston), single-stage or 
multistage (number of stages of compression 
before air is deHvered).They are positive 
displacement pumps and usually have self- 
acting valves (cylinder inlet and outlet valves 
needed to prevent backflow, which are opened 
and closed by pressure differentials across 
the valves). A typical example can be seen in 
Figure 2. 30. The range of sizes and pressures is 
from the order of millimetres of water gauge 
pressures (e.g. 5 mm water gauge is p = pg h 
= 1000 [kg/m 3 ] X 9.81 [m/s 2 ] X 0.005 [m] 

= 49.05 Pa), as might be used for medical 
purposes, to several meters of water gauge 
pressure (e.g. 20 m water gauge is p = pg h = 

1000 X 9.81 X 20 = 196 200 Pa or 1.9 bar 
gauge or approximately 2.9 bar absolute) and 
up to hundreds of bar pressure, as might be 

used for compressing carbon dioxide to be Figure 2.30 Reciprocating air compressor of two- 

buried in disused oil and gas wells for carbon stage type, with a big lower pressure piston, and a 

dioxide sequestration. smaller, higher pressure piston 
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There is some basic terminology that must be remembered for a proper discussion of 
reciprocating compressors to describe their operation fully. These descriptions are related to 
Figure 2.31: 

stroke - distance from bottom dead centre (BDC) to top dead centre (TDC) 

clearance volume — approximately 10 per cent of swept volume allows the valves to be clear 

of the piston 

swept volume — volume swept through from BDC to TDC 

machine cycle - the gas goes through a machine cycle rather than a full thermodynamic 
cycle because it is drawn in and exhausted part way through the cycle 
automatic valves - spring return valves 



Figure 2.31 Schematic of reciprocating air compressor stage 
with inlet and outlet flap valves and piston with crank 


The gas in the compressor does not undergo a true 
thermodynamic cycle according to the definition of being 
taken around a series of states and returned to its initial 
condition. It is a machine cycle, since the same mass is 
not taken around the cycle, but rather has its state changed 
from state 1 to 2 in Figure 2.32, whereupon it is delivered 
to further processes separate from the compressor. The 
indicator diagram is a pressure— volume diagram of 
the processes occurring in the compressor (Figure 2.32), 
which describes the progression of the gases through the 
compressor from inlet at pressure p\ to exhaust at p 2 . These 
diagrams are commonly produced mechanically from 
compressors in order to establish the actual performance, 
comparing instantaneous pressure in the cylinder with 
the position of the piston. There is a sequence of events 
operating on different quantities of the working fluid: 



Figure 2.32 Machine cycle - alternatively called an 
indicator diagram - of a single stage of a reciprocating 
air compressor showing valve bounce 


(1) Some time after TDC, the intake valve opens at position 4 when pressure in the cylinder 
has dropped to p\ and the air over the cylinder inlet is able to force its way through the flap 
valve. 

(2) As the piston continues travelling downwards, air is drawn in until the BDC at position 1; 

(3) The piston continues in its cycle now heading towards the top of the cylinder with the gas 
inside the cylinder trapped by the two valves, and compressing according to a polytropic 
process. 

(4) Delivery (exhaust) occurs when the piston has moved some way towards TDC at which 
point the pressure in the cylinder just overcomes the pressure p 2 and the flap valve opens. 

(5) The piston continues to TDC driving air out into the receiving chamber at the higher 
pressure. 
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(6) The piston reaches TDC, leaving a small volume to clear the valve movements and a small 
volume of gas at the higher pressure is trapped inside the cylinder. 

(7) This volume of gas is expanded in a polytropic process to the lower pressure and the cycle 
repeats. 

If the polytropic processes occur rapidly, they are near adiabatic; if slowly, then they are near 
isothermal. Hence in the middle must be polytropic: pv n = constant, and 1 =£ = w =£ = y. 
Typically, the value of n for air would be 1.3 (y = 1.4). The ideal (indicated in the diagram) 
and actual indicator diagrams differ because of imperfect valve operation, leakage past the 
piston, non-constant n during compression and expansion, and valve bounce. Valve bounce can 
often cause pressure oscillations on the ‘constant pressure’ processes, as the valve bounces on its 
springy mounting as indicated in Figure 2.33 


Spring 



Figure 2.33 Typical arrangements of self- 
operating valves 


Work done during compression 

The cost of running a compressor is in the work done, or rather the power requirement, to 
drive the machine cycle and deliver the required mass flow of the gas. There is an ideal case, 
and an actual work done, and also the case of the minimum that is thermodynamically possible, 
which are all covered in this section. 

Reversible (ideal) work done (by compressor on gas, i.e. positive) for equivalent steady flow 
open system is: 

For an open system, reversible work done is 

dw = fldp (2.46) 

therefore for a polytropic process 

w = - Pi v i) or w = -^zrjR(T 2 - T t ) 

w is specific work (work done per kg of working fluid) for a polytropic steady flow process. 

The overall work is therefore: 


W is the work r; 
Using 


■ n (P2 V 2\ 

W= mp-iv-* — 1 

1 1 » - 1 \PlVij 

r power required, for total mass flow rate of air. 


Equation (2.47) becomes: 


(B -i » (g) 


(2.47) 


(2.48) 


(2.49) 
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This is ideal in the sense that it takes no consideration of the practical workings of the air 
compressor, which will involve mechanical work and so reduce the efficiency of the air 
compression, i.e. there will he losses due to mechanical inefficiency. 


Less work is better for a compressor — it is the effort required to deliver 
the air to the higher pressure. The magnitude of the work decreases as 
n decreases — the limiting point is n = 1, i.e. isothermal as depicted in 
Figure 2. 34. This is the minimum work to compress the air to the higher 
pressure and therefore would be the ideal if it were practically possible. The 
reversible polytropic index work is the actual ideal work done — with the 
polytropic index — somewhere between adiabatic and isothermal. Actual 
implies according to a realistic process, ideal implies that no other losses are 
accounted for such as losses due to friction and turbulent flow. 

Figure 2.34 shows graphically how isothermal compression requires less 
work than polytropic compression. Work done for isothermal compression 
pv = c and polytropic compression pv n = c is the area under the curve 
on the pressure— volume diagram, which is less in the case of isothermal. 
The aim is to compress air with minimum work, so closer to isothermal is 
better. 



compression processes 


Heat transfer to the jacket surrounding the compression chamber 

For polytropic and isothermal processes there must be heat transfer to the jacket around the 
compression chamber, which will always have some kind of cooling jacket in order to enhance 
cooling to help approach isothermal conditions. This is necessary because of the first law, which 
requires heat to be rejected if work is done. For a polytropic process we can calculate the heat 
transfer using the steady flow energy equation: 

Q + W = mAh 

Q = mc p (T 2 - 7j) - f^[T 2 - 7j] 


Using equation (2.49) and the definition of c P , since R = c v (y — 1) 

Q = mc v y(T 2 ~ TJ 


cj ; y - 1) 

f z ] ~ [T 2 - TJ 


ntncJy — 1)1 

Q * |mv/ - 1 l ] [T 2 - TJ 

. y - 1 

Q = m J — q,[T 2 - TJ 


( 2 . 50 ) 


Therefore, the work input, and the enthalpy rise associated with it, require that this heat is lost 
to the cooling jacket surrounding the cylinder. 


Efficiency 

There are standard expression methods for describing how good a compressor is at doing its 
job. The isothermal efficiency describes how close the processes are to the unachievable 
ideal isothermal process. The isothermal work formula is given by: 

W=mRT x In— ( 2 . 51 ) 

Pi 

This is the minimum work required for the compression over the pressure ratio with the given 
mass flow rate. Comparing this with the actual ideal work produces the isothermal efficiency. 

The volumetric efficiency describes how well the operation of the machine draws air into 
the cycle and delivers it to the high pressure reservoir. The loss of efficiency results from the 
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small clearance volume described in (6) of the machine cycle above. The air is trapped in the 
cylinder when the piston starts its descent and is expanded until the lower pressure is reached, 
by which time it has taken a significant part of the potential inlet stroke of the piston. 


_ volume induced at inlet si 


swept volume 


Vi - n 

Since nq voi = — and V l = V s + V c , then 


v. + v c - vA — } 


me -i 


(2.52) 


Volumetric efficiency decreases as the pressure ratio 
increases and as index, n, is reduced, as can be seen in 
Figure 2.35; there is less volume drawn in during the 
induction stroke. Dotted lines show the effects of 
decreasing the pressure ratio and of decreasing the 
polytropic index. In the first case, the swept volume is 
constant and the expansion has to take up more of the 
swept volume. In the second case, reducing n causes the 
expansion and compression lines to lean more towards the 
horizontal (i.e. towards pv° — constant pressure process). It 
can be seen that increasing isothermal efficiency reduces 
volumetric efficiency. Whatever the higher pressure is, the 
clearance volume is the same and the gas in the clearance 
volume expands during the same part of the induction 
downstroke of the piston, taking up an increasing part of 
the swept volume, V s , as the high pressure is increased or 
the polytropic index is reduced. 



Figure 2.35 Effect of increasing pressure and index n on the 
volumetric efficiency 


Mass flow rate delivered can be calculated using the volumetric efficiency and the stroke volume 
per cycle: m inducetl = nq vo j . and per second: m = N.r\ vol . V s .p\ where N is compressor speed 

i.e. rpm/60 for single-acting case, and 2 X rpm/60 for the double-acting case. 


Multistaging and intercooling 

Volumetric efficiency is affected by pressure ratio such that an absolute limit occurs, at which 
no delivery occurs, and the entire stroke is taken up in expanding the clearance volume. For 
example, the formula for volumetric efficiency shows that delivery is zero when Phigh/piow 
= 22.6 for n = 1.3 and -y = 10%. In practice, (phigt/fiow) f° r a single stage is limited to 
approximately 4:1, and two or more stages are used for higher delivery pressures. The pressure 
between stages is the intermediate pressure p ( (which is now the inlet pressure for the second 
stage). This is referred to as multi-staging, in which the first stage of the compressor delivers 
to a further stage at the higher pressure, which in turn increases pressure further. 

In the simplest case of two consecutive stages, as shown schematically in Figure 2.36, each stage 
can be treated as a separate subject, considering the conservation of mass between the two 
stages and that pressure from the outlet of the low pressure stage is the inlet pressure for the 
higher pressure stage. It is obviously desirable, to reduce the work done by the compressor. The 
total work input is a minimum when the intermediate pressure is p, = yPhighPlow-This formula 
can be derived by expressing the formula for the combined power in terms of the intermediate 
pressure, p,-, and differentiating with respect to p,- to give the point at which minimum work 
occurs when its value is zero. In this case, the work in the low pressure stage is equal to the 
work in the high pressure stage, W l P = W H P 
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In order to maintain a good isothermal efficiency and thus keep the work done to a 
minimum, there is an opportunity in multi-staging to reduce the temperature of the 
intermediately held fluid. In air compressors, this is achieved by applying a water cooling 
jacket around the intermediate receiver chamber as illustrated in Figure 2.37. This is 
referred to as intercooling. Ideally the temperature in the intercooler is reduced to the 
same temperature as the inlet temperature, and this is called complete intercooling. 
Figure 2.38 shows the benefit of this in a single-acting two-stage compressor on 
the indicator diagram, since the dotted line shows the machine cycle for the second stage 
without intercooling for the same mass flow as the first stage; this area is reduced on the 
p-V diagram in the intercooled case, and a smaller (i.e. physically more compact) second 
stage is required, doing less work. The area between the dashed and solid line is the work 
saved by intercooling. 



Figure 2.37 Schematic of a two-stage compressor with an 
intermediate receiver chamber which can be cooled to apply 
intercooling to the working fluid 


V s - 



Figure 2.36 Schematic p-V 
diagram of a two-stage 
compressor 


A further benefit may be obtained in two-stage compression, by making use of the 
downstroke of the piston as shown in Figure 2. 39. This is a double-acting two- 
stage compressor since it acts in both directions of travel of the piston. Air drawn 
into the low pressure upper part of the machine, is delivered to an intermediate 
pressure chamber, which in turn provides the inlet fluid for the high pressure stage 
in the lower part of the machine. In practice, the sectional area of the lower cylinder 
must be smaller than that of the upper in order to maintain balanced mass flow 
between the two stages. Incidentally, it is also possible to have a double-acting single- 
stage compressor to improve compactness of a particular design. The additional cost is 
in the design and manufacture of a more complex machine. The figure shows that as 
with the case of a single-acting two-stage compressor, intercooling can be applied in 
an intermediate pressure chamber. 


c 


Intercooler coils 


-27 


Clearance 

volume 



Figure 2.38 The effect of 
intercooling is to reduce the 
work in the higher pressure 
stage; this shows that the 
same mass can be compressed 
using a smaller high pressure 
compressor stage 


Figure 2.39 Schematic diagram of double-acting compression 
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Learning summary 

By the end of this section you will have learnt: 

/ reciprocating compressors use a piston driven in a cylinder, with a volume of gas at low pressure 
induced on each cycle, which is compressed and delivered at a higher pressure; 

/ the terminology for compressors describes how the machine is constructed, and the method of 
operation. In particular, the clearance volume is significant for the operation of the compressor since 
it limits how much air can be drawn in during the induction stroke; 

/ since a new volume of gas is drawn in at every induction, a state diagram does not represent a 
true cycle, but rather a machine cycle. The p-V diagram is known as the indicator diagram of the 
compressor; 

/ the compression requires work, and it is possible to calculate the ideal work assuming polytropic 
processes, and no losses to friction in the machine. This work involves heat generation, and heat 
that must be lost in order to comply with the first law of thermodynamics; 

/ the two measures of compressor efficiency are the volumetric efficiency that describes how much 
the volume drawn in is limited by the expansion of the clearance volume, and the isothermal 
efficiency that describes how far from the ideal isothermal compression the machine cycle 
process is; 

/ multistaging is used because of the limit of volumetric efficiency on pressure ratio. An intermediate 
pressure is achieved in one stage, followed by another stage to the higher pressure; 

/ there is an ideal intermediate pressure, based on the least work done, which is found to be when 
each stage does equal work; 

/ intercooling is used in multistage compression in order to more closely approach the isothermal 
condition and improve the compression work done for a given amount of work. 


2.6 Heat transfer 

Heat is energy in the process of transfer under the driving potential of a temperature gradient. 
The subject of heat transfer deals with the manner and rate at which this transfer takes place. 
Heat appears at the boundary and may be generated by means of chemical and nuclear 
reactions within the body of a system. Our attention is limited to steady-state heat transfer — i.e. 
where there is an established temperature gradient that is not changing. Temperature gradient 
in this context refers to dT/dx, a variation in spatial dimension but not in time. Unsteady 
heat transfer is a little beyond our scope. Heat transfer contributes to the irreversibility of 
thermodynamic processes. This is due to: 

• heat transfer through finite temperature differences 

• combustion or chemical reactions with a finite temperature difference between the fluid 
and the surroundings. 

We have already encountered two process types specified by the method of heat management 
within the process: 

• adiabatic — no heat transfer, pv* = c, q = 0 

• isothermal — constant temperature, pv = c 

Heat transfer can therefore be seen to affect the efficiency of processes, and as we shall see, 
this is a direct consequence of the second law. It is also important for cooling of engineering 
components at all scales. 
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Conduction 


In conduction, energy is transferred from molecule to molecule of a substance by passing 
on vibrational kinetic energy. Under steady conditions the molecule will pass on as much 
energy as it receives; under non-steady conditions, however, the mean energy level of the 
molecule will rise. The KE of the molecule is proportional to its absolute temperature. The 
most obvious example is transfer of heat through a solid material. Figure 2.40 shows a brick 
wall. If the surfaces of the brick wall are maintained at T\ and T 2 with the brick initially at T 0 , 
this simplified schematic shows that a transient development of temperature profile up to the 
steady condition will occur. The true profile of temperature development is similar to this, but 
complicated by the true physical behaviour of the medium. Transient heat transfer is beyond 
the scope of this volume, so what is considered here is the end state of the transient: the steady 
state condition. 



Temperature at steady 



Temperature at start 


Figure 2.40 Temperature gradient through a brick wall, 
which develops as the result of a change from T 0 to T t on 
the left-hand side 


Fourier’s Law for conduction 


The law that describes conduction is an experimentally determined law known 
law. For one-dimensional heat flow it may be expressed as: 


Q = - kA ^; or i"=~ k ^w 


where Q is the heat flux, o 


e of heat flow, through area A of surface 


Fourier’s 


q" is the heat flux density, or rate of heat flow per unit area, indicated by the double prime 


k is the thermal conductivity of the material 
^ is the temperature gradient in the x-direction. 


Units of k are [W/mK]. ‘One-dimensional’ means that the heat travels in only one direction, 
and gradients of temperature in other directions are zero. Taking the example shown in Figure 
2.40, the steady state condition has a linear gradient of temperature, and we may consider AT/ 
Ax = (T 2 — T|)/ (x 2 — x 1 ).The area through which the heat flux flows is perpendicular to the 
page, and has area A, or we may usefully consider the heat flux per unit area of wall. 

The conductivity of materials tends to alter with temperature, and certainly varies according 
to the type of material - metallic solids are known to conduct well compared to non-metallic 
solids. The chart (taken from Bejan, 1998) in Figure 2.41 shows the variation of conductivity 
with temperature and for several materials. There is an enormous range, especially at low 
temperatures. At temperatures above room temperature (about 300K) conductivities vary less 
than at temperatures approaching absolute zero, only in the order of hundreds of W/mK. 
Figure 2.41 shows that materials have a wide range of conductivities — for example, notice that 
the conductivity variation with temperature of copper, which at near absolute zero can reach 
10 000 W/mK, but at room temperature is 400W/mK, and compare with quartz that ranges 
from 0.1 to 3W/mK over the same range of temperature. There are therefore engineering 
materials suitable for different heat transfer requirements, and every material has a conductivity 
that can be used to inform us how well a body will conduct heat. 
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Figure 2.41 Heat transfer coefficient for various materials over a range of temperatures 
(Bejan, 1993, Heat Transfer, John Wiley and Sons, Inc.) 


It is useful at this point to consider the second law consequences of heat transfer. All heat 
transfer across finite temperature difference is irreversible. Q transfers to increase the internal energy, 
u [J/kgK] of working fluid; the working fluid proceeds round the thermodynamic cycle and 
rejects the same heat, Q, in cycle at ^COLD to return to its initial state. The second law states 
that it is impossible to get the same heat to transfer back to the hot reservoir again without 
doing work — in other words without adding extra energy to transfer the same heat back again. 
This demonstrates the irreversibility in a thermodynamic cycle. 

Conduction is degradation of heat. Referring to Figure 2.42, which shows conduction 
through a solid from a hot source to a cold sink at steady state, the heat leaves hot at 
Q/Thot and enters the cold at Q/T COLD . Since T HOX > T cou> Q/T HOX < Q/T COLD , 
i.e. -Scold > -Shot- This shows that the entropy contained in the heat energy that is transferred 
has increased. The conductor at steady state has no change of state, its entropy has not 
changed by this heat transfer — it is the source and sink, i.e. the surrounding ‘universe’ that has 
experienced a change in entropy. 



Figure 2.42 A solid 
conductor transferring 
heat from a hot source 
to a cold sink 


Thermal resistance 


A trick is used in order to make calculation of heat transfer simpler. It notes the similarity 
between the physical nature of heat transfer and electrical conduction. Starting with Fourier’s 
Law for heat conduction, integrate in the direction of heat flow (x) to give: 


<2 = 


-AT 

x/kA’ 


i" 


-AT 

x/k 


where AT is the overall temperature difference across a thickness x (and k is constant in the 
direction of heat flow, if the heat flow is unidirectional). 

Compare this equation with I = V/R - (Ohm’s Law) and the analogy between heat flow and 
electric current flow becomes apparent: 

temperature difference AT — voltage difference, AC 
heat flow, Q — current flow, I 

thermal resistance x/kA - electrical resistance, R 

Thermal resistance [K/ If 7 ] is dependent on the material conductivity (fe), the thickness 
through which heat is passing (x) and the cross-sectional area through which it travels {A). 


Thermodynamics 


The example in Figure 2.43 shows how thermal resistances can be 
used to simplify the analysis of a wall having several layers of different 
materials. This effect of thermal conductivity variation in a plane wall 
can be found in many household and engineering situations, e.g. a 
wall of a house or a refrigerator compartment, or an insulated pipe. 

At steady state, if heat, q, passes through the first wall surface, the same 
heat, q, must pass through all interior faces. (Note, in the steady state ; this 
condition does not hold in the transient case.) The relative temperature 
gradients in Figure 2.43 illustrate that materials with low k are more 
resistive to heat transfer — i.e. if they are good insulators, they maintain 
a high gradient of temperature. It is not necessary to know the interior 
face temperatures to know the overall heat transfer - all that is required 
is the material properties and its thickness. 

We know q = \ = under steady conditions, hence applying 

Fourier’s law to each layer gives 

—A T x _ -A T 2 _ -AT 3 
5 “ R th1 “ R-TH2 “ R-TH3 
q (&TH1 + -RtH 2 + -Rfm) = 2(ATj + AT 2 + AT 3 ) 



Figure 2.43 A plane wall with three layers 
of increasing conductivity; note low 
conductivity results in steep temperature 
gradients - useful in situations where 
insulation is required - and vice versa. 


(2.53) 


It is clearly seen that knowing the external temperatures, and the property and thickness of 
the materials in between, that the heat flux per square metre of wall can be easily calculated. 
From this the temperature at the interior boundaries can also be calculated if desired. A thermal 
resistance network can be used to calculate heat transfer in more complex situations connecting 
complex heat flux paths between known temperature points. 


Convection 

Convection includes heat transfer from solid boundaries into fluids between two mixing 
streams of fluids and the redistribution of internal energy within a fluid. Convection implies 
the movement of fluid to transport energy, and so the study of the fluid motion characteristics 
of the situation is of particular importance. Fluid motion may be induced by buoyancy effects 
resulting from changes in temperature within the fluid during the initial stages of heat transfer, 
which are then perpetuated as natural or free convection. Alternatively, the fluid motion may 
be ‘forced’ by an external source — pump or compressor — and forced convection of heat will 
develop. Fluid motion conveys (or convects, carries with it) heat that can be exchanged with 
solids/fluids/gases at other temperatures. 

Newton’s law of cooling 

The rate of heat transfer is defined using a convective heat transfer coefficient on the 
temperature gradient between the bulk part of the fluid and the surface of the wall by using 

Newton’s law of cooling: 

Q=-hA(T f -T w ) (2.54) 

where h is the heat transfer coefficient [W/m 2 K] 

A is the area of the surface in contact with the fluid [m 2 ] 

T w is the wall temperature [K or °C] 

and Tj is the bulk fluid temperature [K or °C], i.e. the temperature in the fluid far from the surface. 

Equation (2.54) defines the heat transfer coefficient, k.The units of h are [W/m 2 K] i.e. heat 
transferred per second, per m 2 of contacted surface area, per degree Kelvin temperature 
difference between bulk fluid and surface. 
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h is difficult to determine analytically for a particular situation since it depends on relatively 
complex fluid flow behaviour that does not lend itself to analysis. The values are usually 
determined by experiment and the use of this empirical data is described in the next section. 
The chart in Figure 2.44 shows approximate ranges of values given the fluid type. 


Boiling, water | 
Boiling, organic liquids | 
Condensation, water vapour | 
Condensation, organic vapours | 

Liquid metals, forced convection 


| Water, forced convection 
] Organic liquids, forced 


| Gases -200atm, forced convection 
| Gases— latm, forced convection 


Gases— natural convection 


1 10 10 2 10 3 10 4 10 5 10 6 
h (JT/m 2 K) 

Figure 2.44 Convective heat transfer - the ranges that occur in 
various common convective situations (adapted from Bejan, 
1993, Heat Transfer, John Wiley and Sons, Inc.) 


Thermal resistance of combined convection/conduction can he calculated in a similar way to 
the conduction case, making use of the similarity with Ohm’s law. The thermal resistance of 
a fluid/solid boundary is defined from Newton’s law. Referring to Figure 2.45, which shows 
a solid plane wall with a high temperature gas on one side and a low temperature gas on the 
other: 


4 a ~ h A A A (T WA T^^) 


(^wa T'ooa) _ (?wa 'Tqoa) 
Vh A A A " R th , a 


kA.rj, m , _ _(7 b Za) _ _(7b 7a) 

Ax [ n 1a) A x/kA ^TH,A-B 


(2.55) 

(2.56) 


^B^bC^ooB Twb) ■ 


(?ooB ?Wb) 

1/Mb 


(?ooB Twb) 


(2.57) 


And the result of this is a combined convective— conductive steady-state 
situation as shown in Figure 2.45. 

Equations (2.55), (2.56) and (2.57) can be combined to give 

. —(Tqqb ~ T'ooa) 

q ~ l/h A A A + A x/kA + 1/Mb 

As before, q is the same through both boundaries (i.e. in the three 
media that it passes through from A to B). There is no need to know 
the intermediate temperatures, only AT need be known in order to 
calculate the overall q. Note: a fluid layer could he contained within a 
structure, such as the air gap in a cavity wall, making multiple thermal 
resistances from the external to interior regions. The thermal resistance 
method is especially useful in such cases. 



Figure 2.45 The effect of convective heat 
transfer at surfaces separating solid and 
gas regions 
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Overall heat transfer coefficient 

Conversely to thermal resistance, the overall heat transfer coefficient, U is useful to give a 
measure of how well heat is transferred in a combined mode of heat transfer. The overall heat 
transfer coefficient is defined similarly to the convective heat transfer coefficient: 

q = ^(ATJ = UA( -T^b) 

UA = (2.59) 

The overall heat transfer coefficient is very useful in complex heat transfer situations such as 
found in heat exchangers, where a large number of tubes and fins may be used, and it is more 
convenient to select a suitable representative area and find the heat transfer for that surface, 
taking into account all the thermal resistances that are involved. 


Axisymmetric problems 

The case of a cylinder with a hot inside surface and cool outside surface is shown 
in Figure 2.46. The situation is commonly found in domestic central heating, and in 
thermodynamic cycles like the vapour power cycle and the refrigeration cycle. The 
temperature profile is not linear as we look radially from the centre to the outside, and 
this is because the area across which heat passes increases as the radius increases. We 
use Fourier’s law of conduction, which still applies, but we must take into account the 
varying area: 

q = —kA^r- (2.60) 

q" = ~k^ (2.61) 

q' = -k.2m.^ (2.62) 

The prime and double prime marks are used to indicate per unit length and per unit 
surface area, respectively. In this case, the temperature profile is no longer linear even 
if k is constant. The measure per unit length is particularly useful in the case of pipes 
containing fluids exchanging heat. 




Figure 2.46 Axisymmetric case 
of conduction - temperature 
distribution across the wall of 
a cylinder 


Recasting the formula as 


and integrating between A and B gives 

k' , r B 

w-rin- - -($ - 


7a) 


SO 

., = (Tb - Ta) 

q Infa/'A) 

2irk 


(2.63) 


For inner convection, then conduction, then outer convection, R n { is 

1/(R X H, convection inner + ^TH conduction of wall + ^TH convection outer) • 

From equation (2.63), R TH conduction of the annular wall is In (r B / r A )/2'7tfe. If there is convection 
heat transfer at the inner and outer surfaces, Newton’s law of cooling still applies, but note that 
the inner surface is now smaller than the outer surface area, so: 


-A T overall 

1 ln(r B /r A ) 1 

h A 2 r nr A 2'nk /i B 2TTr B 


(2.64) 


where AT overa] ] is the temperature difference between fluid outside and inside the pipe. 
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Convection mechanism 


If a fluid moves near to a wall, at the wall, the velocity of the fluid is zero, 
into the fluid takes place by conduction. Thus the local heat flux per unit 


and the heat transfer 
area q" is 


(2.65) 


The hot fluid close to the inner layer is then carried away by convection. From Newton’s law 
of cooling 

q"= -M(T WALL - TJ (2.66) 


Combining equations 2.65 and 2.66 at steady conditions gives 
/t _ -fe(&?y8y)waii 


(2.67) 


This tells us how h can be determined experimentally. A heat flux probe having a known 
thickness is placed on the wall to determine conduction into the fluid. What we would prefer 
is a calculation. Something that relates experimental results with data about the flow - a 
correlation - is required. To understand how the fluid motion affects the heat transfer, it is 
necessary to have a basic understanding of the fluid mechanics involved, and for this it may be 
useful to refer to the fluid mechanics section 1 .3 that addresses boundary layers next to walls. 
The following sections describe the influence of the boundary layer on heat transfer. 


Velocity boundary layer 

In the case of a solid surface with fluid flowing over it, a boundary 
layer forms as indicated schematically in Figure 2.47. This is a layer 
in which the velocity varies, such that on the surface, the fluid 
is stationary because of the solid/fluid interface, and increases to 
the free stream velocity as distance increases towards the free- 
flowing fluid distant from the surface. The laminar sublayer is very 
thin, < 1 mm, and is the region in which laminar motion of the 
fluid occurs, which is dominated by the viscous friction between 
fluid molecules sliding over each other. In fluid that is moving 
sufficiently slowly, the entire boundary layer is occupied 
by laminar motion, and a linear increase of velocity is observed. In faster-moving fluid, the 
laminar flow cannot continue when the velocity gradient causes entrainment of the laminar 
layers into the faster-moving fluid. This ‘tripping up’ of the flow causes chaotic random motion 
known as turbulence, and the nature of the boundary layer then becomes turbulent, with a 
thin laminar sub-layer. Viscosity causes momentum diffusivity, by enabling the spreading of 
momentum by the dragging forces applied by the resulting friction. The velocity boundary 
layer grows with distance in the flow direction to the order of several mm for the case of small 
engineering components, although this may be much larger, for instance in wind boundary 
layers over buildings. The development depends on the viscosity of the fluid and the speed of 
the flow — a relationship that is expressed well by the non-dimensional Reynolds number. 

Temperature profile 

Velocity boundary layer 
Thermal boundary layer 


Beginning of 
heated section 



Velocity boundary layer 

O i Turbulent 

/'"N s ) eddies 
' <120 


Figure 2.47 Development of the velocity boundary 
layer from the front edge of a stationary plate in the 
flowing liquid 
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Figure 2.48 Development of the thermal boundary layer on a flat 
plate with a heated section 


Thermodynamics 


Thermal boundary layer 

Similar to the velocity boundary layer, the thermal boundary layer, as illustrated schematically in 
Figure 2.48, describes the layer over the surface in which the temperature is less than 99% of the 
main fluid temperature difference from that of the wall surface. Thermal boundary layer thickness 
is usually less than velocity boundary layer thickness. It depends on thermal diffusivity, which 
is dependent on the conductivity of the fluid, and on its thermal capacity, expressed as pc P 

Determination of heat transfer coefficient from boundary layer characteristics 

Values of heat transfer coefficient can be measured for any particular geometry, flow condition 
and set of fluid physical properties. To extend the applicability of the data, dimensional analysis 
is used, which allows comparison of similar situations by scaling the size of the particular 
situations such that the thermal transport character is compared according to the appropriate 
fluid properties. Convective heat transfer can be classified by type of flow over type of 
geometry. Dimensionless numbers are used to relate the scale of the flow. 

There are four characteristic dimensionless numbers that are used to express the 
characteristics of convective heat transfer. 


Nusselt Number 

The Nusselt number, Nu, characterises the heat transfer itself and is calculated from 
relationships with other dimensionless numbers as detailed in the subsequent sections. The heat 
transfer coefficient, h, is derived from it. 



Here the Nu v is a local Nusselt number at a particular distance, x, in the geometry being 
considered. From this, if we know the Nusselt number and the conductivity of the fluid, 
we have the convective heat transfer at any particular point in the geometry. Since velocity 
and thermal boundary layers rely on properties of the fluid and flow, there must be a way to 
relate them to the heat transfer rate and hence Nu. Fortunately, there is analysis to determine 
the Nusselt number for many different geometrical situations; these have been compiled 
by researchers, and a very small sample of them will be considered in order to see how 
they are used. Nusselt numbers are calculated using dimensionless numbers that express the 
characteristics of the heat transfer situation, and these are the Prandtl number, which occurs 
in all Nusselt number correlations, the Reynolds number, which occurs in forced convection 
situations, and the Grashof number, which occurs in natural convection situations. These are 
described in the following. 

Prandtl number, Pr - ratio of viscous diffusion to thermal diffusion. 


It was mentioned earlier that the thickness of the velocity boundary layer depends on the 
viscosity, and that viscosity causes diffusion of momentum. It was also mentioned that the 
thermal boundary layer is determined by the thermal diffusivity. The Prandtl number provides 
a comparison of these effects, and compares them in order to see how well a fluid diffuses 
viscously and thermally. It is expressed as: 


(2.69) 


in which v (pronounced ‘nu’) is the kinematic viscosity, which is the more familiar dynamic 
viscosity, |x, divided by density p, and a is the thermal diffusivity, which is conductivity, k, 
divided by density X specific heat capacity, c p . 

Reynolds number. Re — ratio of momentum to viscous forces. 
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The Reynolds number is used in forced convection situations, where the fluid is driven 
mechanically through the geometry. It is familiar from fluid mechanics and is given by: 
_ pUD 


(2.70) 


in which density and dynamic viscosity have their usual symbols, U is a representative 
velocity and D is a characteristic length, defining the scale of the fluid flow, which must 
be carefully selected for each flow situation, sometimes being obvious (e.g. in a pipe it is the 
diameter), and sometimes not obvious, for instance when a geometry has more than one scale, 
such as in a duct with a cross-sectional area change. 


Grashof number, Gr — ratio of buoyancy to viscous forces. 


The Grashof number is used for natural convection situations, those that are driven by 
thermally induced buoyancy. The idea of natural convection is familiar from the expression that 
‘heat rises’. Convective motion results from the alteration of gas density due to temperature 
gradients, and the hotter gas, which is less dense, rises in the denser cooler gas. 

The Grashof number is expressed as: 

gfiPpPAT 

Gr = ^ (2.71) 

g is gravitational acceleration in which |j = 1/T, which is the compressibility for perfect 
gases, l is another length scale, density and dynamic viscosity have their usual symbols, and 
the temperature difference between the free stream far from the surface, and the surface 
temperature is AT. Fluid properties are taken at the ‘film temperature’, which is the mean value 
of surface and free stream temperatures. 


With these three dimensionless numbers, the Nusselt number correlations can be constructed 
from observing experimental situations. The general form of the correlation is Nu = f(Pr,Gr) 
for natural convection and Nu = f(Pr,Re) for forced convection. 


Example of scale of the various quantities involved: 

Prandtl number for water is derived from a = thermal diffusivity, which for water depends on 
c p = 4.2kJ/kgK, p = 1000 kg/m 3 , v = 8 X 10 _7 m 2 /s and k = 0.6W/mk; therefore 
a = 1.42 X 10 _7 m 2 /s and therefore Pr = via. = 5.63. 


Grashof number for air in the case where AT = 20°C and the mean temperature between 
surface and free stream is 290K, [1 = 1/290, p = 1 kg/m 3 , for a vertical surface at 1 m height, 
1= 1 m, |A = 1.8 X 10 _5 kg/m.s, therefore Gr = 4.17 X 10 10 . 

N.B. Gr has to do with buoyancy, Re has to do with forced flows, Pr has to do with all flows. 
Calculation of Reynolds number should be familiar from fluid mechanics. 

(Data taken from tables in Rogers and Mayhew, 1995.) 


Nusselt number correlations 

A great deal of work has been done to correlate physical heat transfer with Nusselt number. 

For heat exchange on the inner surface of a tube: 

Nuj = 3.66 for laminar forced flow, with a constant surface temperature. 

Nu rf = 0.023 Re/- 8 Pr 0 - 4 for turbulent forced flow. 

For heat exchange on a flat plate: 

NUj = 0.332 Re/- 5 Pr 0 - 33 for laminar forced flow with constant surface temperature. 

For natural convection on a vertical plate, there is sensitivity to the particular range of Grashof 
and Prandtl number characteristics: 

Nu v = 0.59 (Gr L Pr) 0 - 25 where 10 3 < GrPr < 10 9 . 

Nu, = 0.13 (Gr L Pr) 0 - 33 where 10 9 < GrPr < 10 12 . 

Once Nusselt number is known, the heat transfer coefficient can be found directly from the 
Nusselt number relationship Nu = hL/k. More specifically h = kNu d Id for a pipe of diameter 
d and h = INu, lx for a vertical plate of length x. 
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Radiation 

When heat transfers by thermal radiation or by radiant heat transfer, energy is transferred 
via electromagnetic wave motion, which requires no intermediate material and this happens 
at the speed of light, 299 792 458 m/s. Radiation heat transfer will occur between any two 
separated bodies whenever there is a temperature difference between them and they have direct 
line of sight of each other. 



Ice at — 5°C or 268K Sand at 60°C or 333K 

Figure 2.49 Radiant heat transfer occurs whenever there is a 
temperature difference between two bodies that can see each other 


The bodies in Figure 2.49 radiate heat towards each other. It is not one way since both bodies 
are at temperatures above OK; however, the net heat transfer is from the hotter to the cooler. 
What they receive from each other depends on what they can directly see of each other. It is 
important to bear in mind that each body is radiating all the time regardless of what it can see, 
purely dependent on the temperature of its surface, and that net transfer occurs because one 
body emits more than the other. 


Stefan-Boltzmann law for radiation 

The basic radiation law (due to the work of Stefan and Boltzmann) is: 
q h = <JAT 4 

That is the rate of heat emission from a ‘black body’ as indicated by the subscript, b, with 
a surface area A [m 2 ] at temperature T (must be in kelvin) and a is the Stefan-Boltzmann 
radiation constant: 


O = 5.67X1CT 8 Wrn^K -4 . 


A black body is a perfectly radiating body. It emits the maximum possible energy at all 
wavelengths at a given temperature. For real materials, radiation is less than the black body, and 
depends on emissivity, s, which is in the range 0 to 1. Equation 2.72 becomes: 

q b = cteAAT 4 

Table 2.4 shows a range of emissivities for different materials. 


Aluminium foil 

0.01 

Copper, polished 

0.04 

Steel, cold rolled 

0.08 

Steel, rough plate 

0.95 

Zinc, oxidized 

0.11 

Fireclay brick 

0.75 

Asphalt pavement 

0.9 

Cotton cloth 

0.77 

Fyrex glass 

0.9 

Bituminous felt 

0.9 

Frost (snow) 

0.98 


Table 2.4 Typical values of emissivity 
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From Table 2.4 it can be seen that there is a significant range of emissivity for different 
materials, that aluminium foil only emits 1 per cent of the potential black body radiation at a 
particular temperature, whereas frost/ snow is very close to being a black body. 

The Stefan— Boltzmann Law shows us that if a body has a temperature above OK, it will emit 
heat to the surroundings. However, it also receives heat from the surroundings by the same 
mechanism if there is anything in the surroundings with a temperature greater than OK. We 
have to consider what it will receive from other bodies. 

Black body heat transfer 

The law of radiation heat transfer (for black bodies) is: 

j b| i =<tA 1 F 1 _ 2 (T 2 * - V) (2.74) 

Subscript b indicates that this is for a black body surface. The heat transfer depends on the 
surface area of the body that is being considered, A u and the view that it gets of the body it is 
exchanging heat with, the second body having area H 2 .The amount of view is quantified by 

the ‘black body view factor’, F U2 . 


Figure 2.50 The simplest case of radiant 
heat transfer between two bodies - a 
sphere contained within a hollow sphere 
at different temperatures 



For the simplest two body system, referring to Figure 2.50: 

Fu2 = 1 
F 2 _ ! = a x /a 2 
Fi4 = 0 

F 2 _ 2 = 1 - {A x /A 2 ) 

Subscript 1-2 on the view factor means the amount of object 2 that 
object 1 can see in order to receive radiation from it. This is the simplest 
case - a sphere within a hollow sphere. The calculation of other view 
factors is beyond the scope of this unit. 


Worked example 

A man stands in a freezer room of a local fast food restaurant, with clothes 
having a surface temperature of 5°C and emissivity of 0.3. What is the rate 
of heat loss from the surface of his clothes if the temperature of the freezer 
walls is -20°C? Assume that the situation can be modelled as a small sphere of 
surface area 1 m 2 representing the man, inside a larger sphere, and that black 
body surfaces are assumed. 


Use q bA = <jA x F x _ 2 (T 2 4 - T-, 4 ) with F x _ 2 = 1 and A x = 1 m 2 . 

jb, i_2 = 5.67 X 10“ 8 X 1 X 1 X (253 4 - 278 4 ) = —106 W 

It is interesting to note that 100 W is a reasonably maintainable heat generation rate from 
a healthy standing person, and that the thickness of cloth, y, of conductivity 0.08W/mK, 
say, required to maintain the surface temperature of 5°C if the skin temperature beneath 
is maintained at 22°C is from the Fourier Law: 


and 


106 = 0.08 X 1 X (22 - 5)/y 


y = 0.013 m or 13 mm 

the thickness of a thick fleece. 
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Solid/fluid boundary with significant radiation 




Figure 2.51 A solid body with a constant temperature on the left-hand side and 
a fluid on the left-hand side resulting in combined convective and radiative heat 
transfer on the right-hand side 


Consider a 1-D heat flow in a solid as shown in Figure 2. 51. This case is difficult to solve, 
because it is non-linear with respect to temperature, T, and so the sum of thermal resistance, 

R i h, approach cannot be used. The new procedure requires the solution of energy conservation 
at the surface. Since the heat transfer to the surface by conduction must all leave the surface 
by convection and radiation in order for the steady state condition (with no change in 
temperature of the body): 


icond = icon* + tfrad 

and using Fourier’s law, Newtons law and the Stefan— Boltzmann formula: 

k(T A - T s ) , 

Ax = h(T s - TJ + crF s _ 0C (T s 4 - Tj) (2.75) 

Temperatures must all be in Kelvin in this formula. In order to calculate T s , the surface 
temperature, we have to solve by a numerical iterative approach, or plot on a graph as shown 
on the right in Figure 2.51. 

Alternatively, in the case where the temperatures in Kelvin are reasonably close to each other 
(say within 10%) a linear relation may be contrived as follows by factorizing the Stefan- 
Boltzmann contribution: 

hi = °'A s F s _ 0 c(T/ - TJ) 


hi = (tA S F S-oc(T s 2 - - T oc 2 ) 


hi = <tA s F s _ 2 (T s + T ^ (T s 2 + TJ)(T S - T x ) 

R ™ = aAs F s-oo( T s + Too)(T s 2 + Tj) (2-76) 

Take the average of T s and T x and substitute into equation (2.76) for both as the average, T AVG , 
then 

Rth = 1/^-oo 04T avg 3 (2.77) 

This method may be useful in complex thermal resistance networks having a radiative 
contribution. 


107 


An Introduction to Mechanical Engineering: Part 2 


Learning summary 

By the end of this section you will have learnt: 

/ there are three modes of heat transfer - conduction, convection, and radiation; 

/ conduction heat transfer is determined by Fourier’s law of conduction. The conductivity used in this 
law depends on material properties and temperature and is defined for most materials; 

/ the linear relationship between heat energy transferred and the temperature difference through 
which it moves is analogous to Ohm’s law of electrical resistance. In a similar manner, the heat 
transferred is analogous to current transferred, the temperature difference to the potential voltage 
difference, and the remaining terms are analogous to the electrical resistance, and are termed the 
thermal resistance; 

/ convective heat transfer results from fluid moving from place to place and conveying and 
mixing material of differing temperatures. Forced convection describes situations in which the 
fluid is mechanically driven, by wind or machine, and natural convection describes transfer 
due to temperature gradients in the fluid causing buoyancy and hence naturally driven 
circulation; 

/ Newton’s law of cooling assumes a known convective heat transfer coefficient that depends on 
the flow configuration. Once known, this heat transfer contribution can be treated as a thermal 
resistance similar to the conductive case, and combined overall thermal resistance can be used to 
calculate overall heat transfer; 

/ conversely, the inverse of the overall thermal resistance divided by the overall surface area available 
for heat transfer is known as the overall heat transfer coefficient; 

/ in the case of heat flow radially, the surface area of conduction increases with radius and leads to 
a logarithmic expression for the thermal resistance due to conduction; 

/ convection depends on fluid flow, and hence on laminar and turbulent flow characteristics; 

/ similar to the velocity boundary layer, there is a thermal boundary layer in situations where heat 
transfer is taking place. It depends on the thermal diffusivity (the thermal equivalent of viscosity), 
and on thermal capacity, m.c p ; 

/ the Nusselt number is a dimensionless parameter that represents convective heat transfer 
coefficient and, if known, provides the convective heat transfer coefficient directly. Correlations 
of experimental data with Nusselt number are made with other dimensionless numbers: 
the Prandtl number is used for all Nusselt number correlations and is the ratio of kinematic 
viscosity, v, to thermal diffusivity, a; Reynolds number is used for Nusselt number correlations 
in forced convection situations; Grashof number is a dimensionless ratio that represents 
buoyant and viscous forces, and it is used for Nusselt number correlations in natural convection 
situations; 

/ Nusselt number correlations are available in heat transfer texts from experimental evidence; 

/ radiant heat transfer is by electromagnetic radiation due to the release of energy from molecules 
excited by heat energy. It is determined according to the Stefan-Boltzmann Law of radiant heat 
release. For radiative heat transfer, two bodies at different temperatures having direct sight of each 
other release and receive heat dependent on their temperatures; 

/ the emissivity of a surface limits the amount of radiant energy released and modifies the Stefan- 
Boltzmann Law for surfaces that are not thermally black. A truly thermally black surface is not 
necessarily coloured black, but has the property that it releases all its radiant energy due to its 
temperature; 

/ it is possible to calculate combined mode heat transfer with conduction, convection and radiation, 
but in cases where radiant heat transfer is present, the calculation is not directly solvable due to 
the fourth power of T in the Stefan-Boltzmann Law. 
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2.7 Heat exchangers 

Heat exchangers allow exchange of heat between two fluid streams without mixing them — 
perhaps the most familiar is the car radiator, which cools engine cooling water with a forced 
draught of cool air. There are two main types of heat exchanger — the recuperator and the 
regenerator as shown schematically in Figure 2.52 

The recuperator (Figure 2.52(a)) is the most familiar and 
common type of heat exchanger; it is used in condensers, 
evaporators and boilers, and for cooling liquids and gases. A 
hot stream is contained in tubes that pass through a vessel 
containing the cooler fluid, or vice versa in terms of hot 
and cold. The fluids do not make physical contact with 
each other. They exchange heat through the walls of the 
heat exchanger across a temperature gradient that varies as 
progress is made through the recuperator. 

The regenerator (Figure 2.52(b)) is commonly used for 
preheating atmospheric air for combustion in the furnace of a power station using the exhaust 
gases at approximately 200°C as they travel to the chimney stack. There is a dividing wall 
centrally down the tube in which the regenerator is mounted, with the hot gases on one side 
of the wall and the cold gases on the other. The rotating mesh, or frame of a material with a 
significant heat capacity, has elements that absorb heat on the hot side, and release heat to the 
cold side, alternately taking heat from one side to the other as the mesh rotates. In this way, in 
the power station, the exhaust gases leave at a lower temperature, and the combustion air has 
been warmed, which means that more heat has been used to generate power. Any heat loss in 
a power station is a corresponding loss in power generation, so any recovery of potentially lost 
heat is useful. 

This section demonstrates how to calculate the heat exchange in recuperators and from this 
point they will be referred to simply as heat exchangers. The simplest type of heat exchanger 
is the shell-and-tube configuration, which implies an inner tube containing one of the two 
fluids, contained within an outer shell (which is often just a bigger tube) as shown in Figure 
2. 52(b). These are the simplest type to analyse, because we can estimate reasonably well the 
temperature profile, from our knowledge of heat transfer in convective— conductive modes 
from the previous section, and hence the local and overall heat transfer coefficient. Figure 2.53 
shows a simplified geometry of the shell-and-tube; there are obviously two directions that 
each fluid can pass in, and this gives rise to a very important distinguishing characteristic - 
whether the flow is parallel flow, in which the fluids travel in the same direction along the 
geometry, or counter flow, in which the fluids travel in opposite directions. From the figure, 
it can be seen that the corresponding temperature profiles are significantly affected by the flow 
directions. 




Figure 2.53 Schematic of the 
geometry and temperature profile 
in a shell-and-tube heat exchanger 
with counter (a) and parallel (b) flow 
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Energy balance 

In Figure 2.54, a single-pass shell-and-tube heat exchanger is 
seen, which is the simplest possible configuration: an inner fluid is 
contained within the tube and an outer fluid is contained within 
the shell. By the principle of conservation of energy, heat transfer 
from the hot fluid = heat transfer to cold fluid: 

Q = {mc p (T in - T out )} hot = {thc p (T ouc - T ln )} cold (2.78) 

where thc p is the thermal capacity rate of the fluid, that is, its 
capability of storing heat per degree of temperature rise per second. 



Figure 2.54 An elemental length of a shell-and- 
tube heat exchanger for the consideration of the 
heat transfer in it 


Heat transfer calculation 


With the developed formulae from heat transfer theory we can, in the steady state, set up 
an Ohm’s law analogous formula for the heat transfer in terms of the overall temperature 
difference between the hot and cold fluids at a particular position on the heat exchanger, and 
the thermal resistance at that point, which is dependent on the convective heat transfer from 
the internal fluid to the wall of the tube, the conduction of heat from the inside of the wall to 
the outside of the wall, and the convection from the outside of the wall to the external fluid. 
Hence, 


AToverall 

1 In r/r,- \ 

h A A A + 2vkL + h^A B 


(2.79) 


Subscript A represents the inner (tube bound) fluid and subscript B represents the outer (shell 
bound) fluid. Since the wall of the tube is usually thin compared to the pipe diameter, the pipe 
inner surface area is very nearly the same as the pipe outer surface area, so we can say that A a 
= A b = A, and the thermal resistance can be represented instead as an overall heat transfer 
coefficient for that area: 

q = LH(AT overall ) (2.80) 

= LH(T ooA - T^b) 

in which 


and where the °° symbol indicates bulk temperature of the inner or outer fluid, i.e. the 
temperature of each fluid far enough from the surfaces to be independent of temperature 
gradients. Considering the element shown in Figure 2.54, there is the contribution of heat 
transfer for the elemental length of the heat exchanger: 

dq = ~dA.U(T B - Ta) 

And the total heat transfer is 

4 = Zd'q (2.81) 

summing over all elements on length. This energy balance gives the overall performance of the 
heat exchanger, the transfer of heat between the two fluids. The calculation depends on the 
performance of the heat exchanger, which is due to the heat transfer. Theory tells us how to 
calculate heat transfer across the surfaces, but the situation is complicated by the stream-wise 
gradients of temperature. 

Note: Even if U is constant, (T B — T A ) varies along the heat exchanger. This must be taken into 
account. There are two approaches available for analysis of heat exchangers - logarithmic 
mean temperature difference (LMTD) and effectiveness-number of transfer units 
(e-NTU) - which will be elaborated on in the following sections. 
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LMTD method for sizing heat exchangers 

In a parallel- and counter-flow single-pass tubular heat exchanger, as shown in Figure 2.54, the 
temperature difference between the two fluid streams varies as the flow proceeds from inlet to 
outlet. The LMTD (log mean temperature difference) analysis gives the equivalent constant 
temperature difference for calculation of heat transfer, such that it is the same as the actual heat 
transfer with the temperature varying with progression through the heat exchanger. LMTD 
finds a representative temperature dependent on the overall temperature differences at inlets 
and outlets, which requires the heat exchange between the two separated fluids. It is then 
used to find the area of a heat exchanger that is required to achieve the desired heat transfer. 
Alternatively, if a heat flux is required and temperatures are not known, these can be found by 
iteration. 


Referring to Figure 2.55, the LMTD formula is expressed in terms of the temperature 
difference between the fluid in the tube and in the shell at position a and that at position b: 


LMTD = A T m 


A Ta - ATb 
ln(ATa/ATb) 


(2.82) 


where ‘a’ refers to inlet and ‘b’ to outlet conditions or to any two far apart points along 
the heat exchange surface: 

A T a = (T b - T a ) 3 (2.83) 

A T 6 = (T b - T^) b (2.84) 

These are the overall temperature differences between the two streams. Equations (2.83) and 
(2.84) apply to, and are the same for, parallel and counter-flow arrangements. The LMTD 
formula is derived from consideration of elemental heat exchange and can be easily verified. 


For sizing heat exchangers (i.e. working out required size for given flow rate and 
temperature change), the LMTD approach is used to give the area required from: 


A 


Q 

UA T m 


Rating of heat exchangers (i.e. working out the temperature change in a given heat exchanger) 
can be achieved by iteration using the LMTD approach when there is a known overall heat transfer 
coefficient, U, which must be determined by calculation or from tabulated data. 


Worked example 

Use of LMTD to rate and size a heat exchanger. 



Calculate the LMTD for parallel- and counter-flow cases that have the same 
inlet and outlet stream temperatures. The cold enters at 20°C, and exits at 40°C. 



Figure 2.55 Single-pass 
shell and tube heat 
exchanger 
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' Referring to Figure 2.56, the temperature profile is distinctly different in the counter- 
flow and parallel-flow operation. In the parallel case, 

(80 - 10) 

LMTD - []n(8()/10) ] 

LMTD = 33.7°C 

and in the counter flow case, 

(60 - 30) 

LMTD = TW60730)T 
LMTD = 43.3°C 

Note: The parallel arrangement has the lower LMTD — therefore needs a larger surface 
|9 area for the same heat transfer, i.e. a larger heat exchanger. This shows the advantage of 
■ the counter-flow arrangement. 


Calculation of heat transfer rate 

Q = U. A. LMTD 

The overall heat transfer coefficient (HTC) is determined from 

UA* ^ 

1 1 

2m () h 0 L + 2TtfeL 2 tt rfijL 

The ‘i’ and ‘o’ refer to the inside and outside surfaces of the tube. If the heat transfer 
coefficient variations along the tube are large, then an average U value is used. 

This is usually unnecessary. To a first approximation, evaluate U at 

T _ q+ t „) 

The variation is because the heat transfer coefficient may vary with fluid temperatures since the 
fluid properties may vary with temperature. In a shell-and-tube heat exchanger, with the tube 
carrying hot fluid and the shell cold fluid, the conduction term can usually be neglected if the 
thickness of the wall is small and the conductivity high. 

The LMTD analysis requires the outlet temperatures of the fluid streams to be known. 

This may necessitate an iterative solution involving guessed outlet values. This is the main 
disadvantage of the approach. 

Procedure: 

(1) guess outlet temperatures; 

(2) calculate LMTD; 

(3) calculate heat transfer rate from step 2; 

(4) calculate outlet temperature for step 1 ; 

(5) repeat until converged. 

Comparison of the advantages of parallel and counter flow 

The counter-flow type is the most compact for a given heat transfer rate due to the 
temperature profile. 

The parallel-flow type gives a lower T nlax for the tube wall temperature. 
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In more complex designs of similar type, the 
‘tube’ flow passes through the heat exchanger 
more than once, and the ‘shell’ side flow can be 
directed across the tube to increase turbulent 
heat transfer. Generally, heat exchangers of 
this type are called shell-and-tube heat 
exchangers. 

Figure 2.57 shows a typical shell-and-tube 
heat exchanger. As shown in the schematic, the 
tube is the section between the two end caps, 
and the end caps themselves contain fluid that 
passes through the tubes. There are two passes 
of the tube fluid through the shell fluid in this 
case, and the central mark on the opened end 
of the cap shows how the two halves of the 
cap are separated to allow for this. 
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Cross-flow and multipass 

The stream temperature differences are more complicated in 
cross-flow or multipass configurations. The effects of cross-flow 
and multipass are accounted for by introducing a correction 
factor F such that: 

Q = UAFAT m (2.85) 

The factor F is derived from calculations similar to the derivation 
of LMTD or read from graphs. Figure 2.58 shows a matrix heat 
exchanger with cross-flowing fluids; the resulting temperature 
profiles are complex. The LMTD method relies on prior 
experimental data. 

A simpler and more direct approach for finding the temperatures 
is the e-NTU method. 

Correction factor - LMTD 

Kays and London (1984) catalogues the many experiments about 
heat exchangers conducted by the US Navy in the 1950s, forming 
the basis for modern heat exchanger design and theory. They present 
methods for calculating the behaviour of many different heat 
exchanger types and configurations. The methods described in this 
section come from that text, where you can read the full description. 

The correction factor allows us to use the LMTD type of calculation 
for more complex geometries. We must introduce two new variables: 


; stream and t\ and t 2 are the 



Figure 2.57 A typical shell-and-tube heat 
exchanger used in the hot water system at the 
University of Nottingham 


Tt - T 2 


Tj and T 2 are the temperatures of o: 
other stream. 

Using this method from Kays and London, we can read the correction 
factor from charts in the book, knowing P on the horizontal axis, 
a selection of curves for various values of R, and corresponding 
correction factor, F from the vertical axis. Three cases are illustrated in 
Figure 2.59 from Bowman, Mueller and Nagle (1940) in Bejan (1993). 



resulting temperature profiles at outlet of the 
two streams 
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Figure 2.59 Cross-flow, both streams unmixed (i.e. the fluid passing in each direction is contained in a matrix 
of passages) (R.A. Bowman, A.C. Mueller and W.M. Nagle, 1930, ‘Mean temperature difference in design’, 
Transactions of the ASME, vol. 12, 417-422) 
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If factor P tends to zero, the stream having temperatures t\ and t 2 has a change of phase, 
i.e. t\ -» t 2 . 

If R tends to zero, the stream having temperatures 7) and T 2 has a change of phase. 

If a stream has a phase change, the capacity rate is effectively infinite, because the fluid will 
absorb heat without changing temperature. These are the two limits on the charts. 

Figure 2.60 shows what is meant by the terms used with cross-flow heat exchanger: the 
corrugations keep fluid moving as an unmixed stream as it passes through the channel, 
the open passage has no corrugations and fluid is free to move across the full face of the 
channel and is called a mixed stream. 

Care using LMTD 

It is important to understand the capacity rate of the fluids in the heat exchanger, i.e. 





Qold = »>coidScoid and C hot = m hot c Pihot 

These determine the magnitude of temperature changes for a certain heat transfer. It is 
important to always use the formula for LMTD: 

q = UA A T m (2.86) 

together with the heat capacity formulae: 



q = { me p (T in - T out ) } hot = {mc p (T out - T ln )} cold 

Using equation (2.86) on its own suggests that the heat transfer does not depend on the 
capacity rate of the fluids. The limit of the heat exchanger capacity is when condensation 
of the hot fluid or evaporation of the cold fluid occur across the heat exchanger, as in a 
boiler for example. 


Figure 2.60 Examples of what 
is meant by cross-flow heat 
exchangers (Bejan, 1993) 


Effectiveness s-NTU method of rating heat exchangers 

This is an alternative method to LMTD for working out the temperature change in a heat 
exchanger, which examines the thermal conductance, UA, as well as the capacity rates, Q, ot and 
Qold; this method introduces two dimensionless groups, the Number of heat Transfer Units: 

NTU = , UA (2.87) 

(™Qmin 

where (mCp) mm is the smaller capacity rate and the effectiveness, s: 

_ actual heat transfer rate _ 
maximum heat transfer rate q m3x 

in which 

q max = C mm AT max (2.88) 


What is the maximum possible heat transfer rate? 

This can best be considered in graphical terms as it asks the question: 
if the UA could be increased to whatever we wanted, what is the 
maximum achievable heat exchange between the two fluids? 

As the size of the exchanger increases, the fluid with the smaller 
capacity rate, which has the steeper gradient (in this case 
illustrated in Figure 2.61; this is Q 0 id), exchanges heat with the 
hot fluid until the limit when the size of the heat exchanger is 
such that it leaves at the temperature of the hot fluid. This is the 
limit of the heat transfer. The minimum fluid may be either the 
hot or the cold fluid. In this case 

tfmax = QoldQhot,m “ Qold,in) 



Figure 2.61 Demonstrating what the maximum possible 
heat transfer between two fluids in a heat exchanger is 
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Calculations 

These usually entail evaluating effectiveness, s from knowledge of the capacity rates, C, and 
NTU values, and then evaluating g actua j. No iteration is required. Expressions for NTU in terms 
of e can he derived by analysis of the rearranged LMTD heat transfer equation: 

\n(AT a /AT b ) = |(A T a - AT h )UA (2.89) 

in terms of effectiveness and NTU, by manipulation of equations 2.87—2.89. For example, a 
parallel flow gives the formula 


NTU paraUelflow 


ln[l ~ S(1 + Cmii/ Qmx)] 

1 + C^/C^ 


which can be manipulated to give effectiveness: 


(2.90) 


1 - exp[— NTU(1 + C mm /C max )] 

1 + 


(2.91) 


A formula is required for each configuration of heat exchanger. Fortunately, rather than using 
the formulae, data are presented in charts that can be read. If C m -,„/ C max = 1 , the capacity 
rates are balanced — a balanced heat exchanger. If C mi „/ C max = 0, one stream, C max , has phase 
change at nearly constant pressure, and effectively has infinite capacity rate. 


Worked example 

A heat exchanger has an overall heat transfer coefficient of 120W/m2K. 
Superheated steam enters the tubes at 250 °C and exits the exchanger at 
400 °C. Hot gases on the shell side enter at 900 °C and exchange heat with 
the steam. The capacity rate of steam is 200 000W/K and that of the gases 
is 300 000W/K. What is the effectiveness? Using the formula for NTU for a 
counter-current shell and tube, determine the required surface area of the heat 
exchanger. What is the temperature of the gases leaving the exchanger? 


NTU = 



The actual heat transfer is q = C steam X (400 — 250) = 30 MW. 
tfmin = C min (900 - 250) = 130 MW The effectiveness, s = q/q^ = 30/130 = 0.23. 
We know that NTU = UA/C min = 120 X A/ 200 000. From the formula for a single 
counter-flow tube and shell exchanger: 

, /I - 0.23 X 20 000/30 000) 

ln 1 - o 23 

NTU = — - 

1 - 20 000/30 000 

Therefore, A = 0.285 X 200 000/120 = 475 m 2 . The gases exit temperature is 
determined by: q = C gases (900 - T exit ), 30 X 106 = 300 000 X (900 -T exit ),and 
T exi t = 800 °C. 
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Knowing 


UA/C mm = NTU 


the chart gives effectiveness, which then yields q actual from knowing q max . Inspection of the 
chart in Figure 2.62(a) shows that if the area, C ITnn , and overall heat transfer coefficient are 
known, then the effectiveness of the heat exchanger is given from the experimental data 
represented by the chart. This is for two shell passes, four tube passes and multiples of them. 




Figure 2.62 Effectiveness-NTU charts for the same geometries as the LMTD illustrated in Figure 2.59 (R.A. Bowman, 

A.C. Mueller and W.M. Nagle, 1930, ‘Mean temperature difference in design’, Transactions of the ASME, vol. 12, 417-422) 


Calculation of the pressure drop 

Calculation of the pressure drop depends on detailed knowledge of the path that the fluid 
follows in flowing through the heat exchanger. Expansions, entries, and bends all need to be 
included. 


There is a pump factor that needs to be taken into account. For a liquid it is: 

^="(P,n -Eout) (2-92) 

in which r\ p is the isentropic efficiency of the pump. 


For a gas it is 



(2.93) 


Intensification of heat transfer is usually accompanied by an increase in pressure drop. The A p 
across the heat exchanger can be an issue with complex flow paths. Work must be supplied to 
deliver fluid through this pressure gradient by a pump. 
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Fouling factors 

Over time, fluids tend to leave residual traces on the surfaces that they flow over. The thin layer 
of deposits from a particular fluid flow in the heat exchanger has a conductive resistance on the 
shell (s) and tube (t) sides. This contributes to overall U s (heat transfer coefficient). 

AT 

q = — and q = U S A S LMTD (2.94) 

KA s + KA t 

to give 


M, + Mt 

Over time, the boundary surfaces corrode and acquire a scale coat, while the fluids gather 
impurities, so fouling factors (r) are used for both shell-and-tube, giving: 



Fouling factor is additive to ideal heat transfer coefficients, and the effect of the fouling depends 
on fluids involved. 

Typical fouling factors are illustrated in Table 2.5 showing the effect of various fluids. 


Temperature of heating 
medium water temperature 

Up to 1 1 5°C 
50°C or less 

1 1 5-205°C 
Above 50°C 

Water velocity 

1 m/s and less 

Over 1 m/s 

1 m/s and less 

Over 1 m/s 

Water types 





Distilled water 

0.0001 

0.0001 

0.0001 

0.0001 

Sea water 

0.0001 

0.0001 

0.0002 

0.0002 

Brackish water 

0.0004 

0.0002 

0.0005 

0.0004 

City or well water 

0.0002 

0.0002 

0.0004 

0.0004 

River water, average 

0.0005 

0.0004 

0.0007 

0.0005 

Hard water 

0.0005 

0.0005 

0.0009 

0.0009 

Treated boiler feed water 

0.0002 

0.0001 

0.0002 

0.0002 

Liquids 





Liquid gasoline, oil and 
liquefied petroleum gases 




0.0002-0.0004 

Vegetable oils 




0.0005 

Caustic solutions 




0.0004 

Refrigerants, ammonia 




0.0002 

Methanol, ethanol and 
ethylene glycol solutions 




0.0004 

Gases 





Natural gas 




0.0002-0.0004 

Acid gas 




0.0004-0.0005 

Solvent vapours 




0.0002 

Steam (non-oil bearing) 




0.0001 

Steam (oil bearing) 




0.0003-0.0004 

Compressed air 




0.0002 

Ammonia 




0.0002 


Table 2.5 Representative fouling factors [m 2 K/W] related to various fluid flows. From Chenoweth (1 990) in 
Bejan (1993). 
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Learning summary 

By the end of this section you will have learnt: 

/ heat exchangers are classified firstly as regenerators and recuperators. The recuperator, which is the 
type involving tubes carrying one fluid surrounded by another fluid at a different temperature, is the 
concern of this unit; 

/ recuperators are defined secondly by direction of flow as counter-flow or parallel-flow of the two 
fluids involved; 

/ thermal capacity rate is the specific heat capacity of a fluid multiplied by the amount of mass of the 
fluid considered, which is the mass flow rate of the fluid. It determines the temperature rise for a 
given heat input; 

/ heat transfer theory shows how an elemental approach could be taken to the analysis of a 
particular heat exchanger, but there are two methods used with overall heat transfer coefficients, 
which allow for simplified analysis in general cases; 

/ since the temperature varies throughout a heat exchanger, the logarithmic mean temperature 
difference (LMTD) is used with a known overall heat transfer coefficient to work out either the heat 
transferred for a given sized heat exchanger or the size of a heat exchanger given the heat transfer 
required. It can be shown that the LMTD is the correct mean to calculate the overall heat transfer; 

/ the collection of experimental results by Kays and London (1 984) provides a method for modifying 
the LMTD for more complex heat exchangers than the simple shell-and-tube configuration; 

/ it is important to remember that the capacity rate of each fluid determines its temperature and heat 
transfer; 

/ the effectiveness (e-NTU) method relates the known data of overall heat transfer coefficient and 
minimum of the two capacity rates in the NTU and relies on experimental data from heat transfer 
texts to relate this to the ratio of actual heat transfer to maximum possible heat transfer between 
the hottest and coldest temperatures available. Again this relies on the catalogue of data by Kays 
and London; 

/ the cost of heat exchangers is the pressure drop for a given size. This must be considered in a 
practical heat exchanger and is determined from fluid mechanics calculations; 

/ fouling often occurs in practical installations and the alteration to heat transfer coefficient must be 
considered; 


The idea of a vapour power cycle is to use a condensable vapour in a heat engine to create 
power from a source of heat. Referring to Figure 2.63 we can see that it is similar to a heat 
pump running in reverse: 


2.8 Vapour power cycles 



Figure 2.63 Schematic diagram of a simplified vapour power cycle 


119 


An Introduction to Mechanical Engineering: Part 2 


• 4-1 work is put in by a pump (at the position of the throttle in the heat pump) to drive the 
liquid-fluid into a high-pressure part of the circuit (rather than expanding into the low- 
pressure part of the circuit); 

• 1-2 significant heat is added in the high-pressure side to convert the liquid into a vapour; 

• 2—3 the vapour passes through a turbine (at the position of the compressor in the heat 
pump), driving the blades round and producing work out of the turbine shaft, and in the 
process losing pressure back to the low-pressure side of the circuit; 

• 3-4 before running around the circuit again, the vapour-fluid is cooled in a heat exchanger 
to convert it fully back into a liquid. 

The process is used for power production and requires that 

(1) the working fluid is a condensable vapour; 

(2) the power cycle consists of a series of steady flow processes. 

For the remainder of this section steam will be considered as the condensable vapour. The 
analysis is simplified by the assumption that kinetic and potential energy changes are negligible 
compared with the change in enthalpy. The steady flow energy equation for change of state 
between points is therefore 

Q + W= H 2 -H 1 

Since the purpose of the cycle is to produce usable work, it is useful to see where the work is 
coming from and where the expended energy used to create the work is being applied. The 
useful measures are thermal efficiency, work ratio, and specific steam consumption. 

Thermal efficiency 

Closed cycle plants used for power generation have a thermal efficiency of between about 35% 
and 50%, where: 


net work output w out — w in ( h 2 ~ k 3 ) — (hi — h 4 ) 

T ' lh external heat supplied q h ot h 2 — hi 

The net work output is measured in MW. It is useful to quantify the total energy output as 
electrical energy from the generator set that is mounted at the end of the turbine shaft, and this 
is often denoted as MW(e) to signify electrical power output. In practice it is very close to the 
work output from the shaft of the turbine since the efficiency of electrical generators is very 
high (usually over 99%). 

Work ratio ( r w ) 

The efficiency of the cycle is not useful on its own — we want to know how much we can 
get out of the process. Work occurs at two points in the cycle — driving the liquid-fluid up 
to the high pressure, which is an energy cost in the system, and releasing energy from the 
high pressure vapour-fluid on its passage through the turbine to deliver work to the electrical 
generator set. The ratio of these is expressed as the work ratio: 

= net work output = (h 2 - h 3 ) - (fe t - h A ) 
w gross work output h 2 — h 3 

Equation 2.96 describes how much work we get out for work we put in - there is no 
consideration of heat here. High r w values are desirable, which indicates low sensitivity of 
efficiency to irreversibilities. Modern steam plants have r w ~ 0.98, while for gas turbines cycles 
r w ~ 0.45, the major difference being due to high compression work involved with gases. 
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Specific steam consumption (SSC) 

The efficiency and work ratio do no inform us of the amount of steam delivered to the 
turbine, which indicates the size of the plant. We use the specific steam consumption (SSC): 

SSC = ( 2 . 97 ) 

W ne t 

which is usually expressed in kilograms of steam per hour per kilowatt of power produced 
(kg/hr/kW = kg/kWhr). A low SSC is good and implies the need for a smaller plant (that is 
lower capital cost). 

Cycles 

The simplified cycle shown in Figure 2.63 can be modified to improve performance on all 
three metrics by application of aspects of the second law of thermodynamics. The following 
applications are used routinely to improve power station performance and are regular parlance, 
so much so that the names of the inventors who first considered the implications of the process 
changes are synonymous with the processes. The technology applied in power stations increases 
the work produced for the size of plant and the amount of work that can be produced from the 
heat available. The theory explains the Carnot cycle, the Rankine cycle, superheating, reheat 
and regeneration. 

Carnot cycle - the most efficient thermodynamic cycle 

The Carnot cycle comprises four reversible processes: two 
isothermal and two adiabatic as shown in Figure 2.64, and these 
are: 

1— 2 isothermal expansion by adding heat, Q H . In this part of the 
cycle, heat is added from the surroundings to convert the water to 
low quality steam (i.e. high liquid droplet content or cloudy steam) 
and then to high quality steam (low liquid content), and eventually 
to saturated steam (i.e. no water content, or dry steam) by 
evaporating the liquid and thus maintaining a constant temperature 
in the steam. The heat input is denoted Q H , since the heat is added 
in the high pressure, or (relatively) hot side of the circuit 

2- 3 adiabatic expansion by passing the steam through a turbine, producing work. This assumes 
that the turbine is perfect, i.e. it has no mechanical-friction or fluid-friction losses. The steam 
temperature drops from high on the high pressure side, T h to low on the low pressure side, T L 

3— 4 isothermal compression by removing heat, Q L . Fleat is removed from the system at this point 
in the cycle to the surroundings, and the temperature is constant as steam gradually condenses as 
it travels, forming small droplets, which may condense on the pipe walls of the system. Note that 

a high water by mass content in the steam is not necessarily a high water by volume content since 
the density is high (about 1000 kg/m 3 , compared to steam at about 0.5 kg/m 3 ) 

4- 1 adiabatic compression by passing the low quality steam through a pump, until the point at 
which the steam-water mixture in the system becomes saturated water with no steam content. 
Temperature increases from r L to t h . 

The constancy of the temperature as heat is added in the high-pressure side and heat is 
removed in the low-pressure side is, thermodynamically speaking, a good thing because we 
know that the Carnot efficiency improves with increasing the difference between the hottest 
and coldest reservoirs (surroundings), and we know that if heat is exchanged in a cycle with 
more than one reservoir at different temperatures, then the efficiency falls from the maximum 
available from the highest temperature and lowest temperature reservoirs to somewhere in 
between. The only way to add heat at constant temperature is with phase change and it is 
achieved in steam by working in the mixture region. 



the temperature-entropy diagram for steam 
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The usefulness of the T-s diagram is apparent from this process as shown in Figure 2.64 in 
which it can be seen that the constant entropy processes are vertical lines, and variation from 
the ideal will be seen if the lines incline to the vertical. 

The Carnot cycle is the basic cycle to aspire to, however, since it is an impracticable cycle, and 
although the thermal efficiency is highest for a given high temperature, higher temperatures 
are achievable by working outside the mixture region. The Carnot cycle is impractical for the 
following reasons: the turbine will physically struggle to expand the vapour in the mixture 
region since at high speeds the droplets of steam that form as the quality of the steam falls will 
impact on the blades and cause erosion, which is very costly to repair; it is difficult to ensure 
that condensation reaches the exact point required for the saturated liquid at the point entering 
the compressor on the left side of the process, and the compressor work is quite high since it 
is not pumping a very dense liquid but a significandy less dense, and therefore higher specific 
volume, fluid, reducing efficiency, which decreases the work ratio. As a point of interest, if the 
cycle is reversed it becomes the Carnot refrigeration cycle. 


Worked example 

Find the SSC, work ratio and thermal efficiency of the Carnot cycle operating 
between 100 bar and 0.3 bar. 


For SSC, we require the work done to compress the fluid, and the net work done. For the 
Carnot cycle, we need the enthalpy change between 4 and 1 on the diagram in Figure 
2.64, as follows: 

%ioobar = 1,408 kj/kg 
^loobar = 3.36 kJ/kgK 

Dryness fraction at 4 is from entropy h 4 = h 4 : 

s 4 /(l — x) + s 4g x = s t 
0.944(1 - x) + 7.767x = 3.36 
* = 0.35 


Therefore enthalpy at 4 is: 

K = 0.35h g03bar + Q65h mbar 

h 4 = 0.35 X 2,625 + 0.65 X 289 = 1,106 kj/kg. 

Therefore 

W4_, = 1408 - 1106 = 302 kj/kg. 


The work out of the turbine between 2 and 3 is: 


kh = h gl00bar - h 3 

h 3 is found by first getting the dryness fraction as above for constant entropy, which is 
0.685, and then using that to get the mixture enthalpy, which is 1,889 kj/kg. Therefore 


SSC for Carnot is 


r w for Carnot is 


A h = 2725 - 1889 = 836 kj/kg. 


1 X 3600 
(836 - 302) 


6.742 kg/kWh. 



For thermal efficiency, the external heat added is required. For the Carnot cycle, this is 
(836 - 302) 

%,ioobar = 1317 kj/kg. Therefore tith, Carnot = ^317 = 40.5%. 


(Using data from tables in Rogers and Mayhew, 1995.) 
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Rankine cycle - reducing compression work 



pump work in the liquid phase, which results in 
reduced loss 


The Rankine cycle, shown on the T-s diagram in Figure 2.65, extends the condensation to the 
saturation line (process 3—4) before using a pump instead of a compressor, to increase pressure 
of liquid water (process 4—1) from the condenser pressure (typically a fraction of atmospheric 
pressure) to the boiler pressure (typically in the order of 100 bar). The mixture at outlet from 
the condenser is saturated liquid, so the feed pump work is greatly reduced since compressing 
liquid requires less work than for a liquid and vapour mixture. The work ratio is increased, 
because 1 and 4 are virtually coincident and, as a direct result of this, W in is very small. For a 
flow process, the reversible process work is: 

W= Judp 

and since v [m 3 /kg] is very nearly constant and not a function of pressure, the work is v[pi 
— p 4 ]. With v being the inverse of density, and knowing that water density is approximately 
1,000 kg/ m 3 , an estimate of the work in changing the pressure from 0.3 bar to 100 bar is 
0.001 X [100 - 0.3] X 10 5 = 9970 J/kg 

or 10 kj/kg. In the Rankine cycle, thermal efficiency is reduced, because not all heat is added 
at the higher temperature, rather a significant amount is added at a temperature lower than the 
saturation temperature, but this is unimportant with regard to the increase in work output. 

This can be illustrated by using typical enthalpy values. SSC is also lower (i.e. improved) 
due to the increased work out per kg of steam during the cycle. These effects are seen in the 
following example. 


Worked example 

Compare the SSC, work ratio and thermal efficiency of the Rankine and 
Carnot cycles operating between 100 bar and 0.3 bar. 


In the case of the Rankine cycle, the compression work is 10 kj/kg of water. Using some 
of the results from the turbine work in the Carnot calculation above: 

SSC for Rankine is 


1 X 3600/(836 - 10) = 4.891 kg/kWh 
Carnot was 6.742 kg/kWh. 


r w for Rankine is 


(836 - 10) 
836 


= 98.8% 
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For Carnot it is 64%. 

For thermal efficiency, the external heat added is required. For the Rankine cycle it is 
Voobar - %o.3kr + 10) kj/kg = 2725 - (289 + 10) 

= 2426 kj/kg. 

Therefore (836 _ 302) 

T lTH,R ankine ~ 2 426 ~~ 22% ' 

The comparison shows that Rankine is a significant improvement over Carnot for SSC 
and r w , but that the thermal efficiency is significantly affected. 


Rankine cycle with superheat 

To improve the cycle efficiency, the saturated steam leaving the boiler is superheated. Heat 
is applied at nearly constant pressure between 2 and 3 by passing the steam through pipes that 
pass through the furnace flue gases at the top of the furnace. 

It is useful to note at this point where a practical implementation is considered, that the boiling 
of water to saturated steam occurs in tubes built into the walls of the furnace that completely 
fill the wall surface area, receiving mostly radiant heat transfer from the hot flame in the 
vicinity of the burners, before rising to a steam drum, where any liquid is passed back to the 
furnace wall pipes and the saturated steam proceeds into the superheating tubes. This raises the 
steam state into the superheated region. 

Figure 2.66(a) gives some indication of the layout in the furnace. The boiler is actually in the 
walls — a large number of vertical pipes line the entire wall surface, and in these pipes the water 
is heated from state 1 to state 2 in Figure 2. 65. The superheater circuit consists of yet more 
pipes suspended above the flame of the furnace in the middle of the hot rising gases. The gases 
having a higher temperature than the steam in the pipes means that there is a temperature 
gradient between the hot gases and the steam in the pipes of a few hundred degrees, and 
significant heat transfer occurs mainly by forced convective heat transfer. 

Figure 2.66(b) shows the process diagram on the T-s chart. The use of modern materials 
allows T 3 values up to 650°C.The Carnot cycle is limited to T max «S 374.15°C (the critical 
temperature for H 2 0), and superheating surpasses this temperature considerably. Some of the 
heat is supplied at T > T sat , raising the average temperature above that of the purely Rankine 
cycle, and thus increasing the thermal efficiency. The other advantage of superheating is that 
the dryness fraction at exit from the turbine (4) is higher than in the Rankine cycle and 
consequently the turbine suffers less erosion by droplet impact. 



Figure 2.66 Schematic of the Rankine cycle with a 


Thermodynamics 


The real process in the turbine becomes important at this point as indicated in Figure 2.66, and 
the isentropic (constant entropy, reversible adiabatic) process 3-4 suffers an increase in entropy 
due to the second law and arrives at the same pressure line as 4, but with increased entropy and 
hence dryness fraction. The dotted line shows this irreversible process. 


Worked example 

Calculate r w , r| XH and SSC for a Rankine-superheat cycle working between 100 
bar and 0.3 bar, with 139°C of superheat. 


The calculation of the liquid work is the same as in the previous example. At 100 bar 
T sat = 311°C, therefore superheated temperature is 450°C.The isentropic work out of 
the turbine requires enthalpy in: 


^100bar,450°C — 3241 kj/kg 


and the enthalpy at exhaust. 

Calculate the exhaust enthalpy using the mixture fraction at constant entropy. Entropy 
at 100 bar and 450°C is 6.419kJ/kgK. Using hj and h„ at 0.3bar as 289 and 2625 kj/kg 
respectively, results in dryness fraction: 


a 


6.149 = 0.944(1 - x) + 7.767x 
x = 0.762 


1 

i 


and hence exhaust enthalpy: 


h 3 = 2625x + 289(1 - x) 
h 3 = 2069 kj/kg 


hence work out is 3,241 — 2,069 = 1172 kj/kg. 

(1172-10) 


'w = 1172 = 99A% 

External heat supplied is 

fci00bar,450°c - (289 + 10) = 3241 - 299 = 2942 kj/kg 
(1172-10) 


■n™ = - 

SSC = 7 


2942 

3600 


= 39.5%. 


-= 3.098 kg/kWh. 


Use of reheaters 

A ‘reheater’ supplies heat at a point part way through the turbine expansion (between turbines). 
The reheater is a tube bank that passes steam back through the furnace flue gases in the same 
way as superheaters, as shown in Figure 2.67. This avoids turbine expansion into low-dryness 
fraction conditions, which cause high turbine erosion by droplet impact. 

The specific steam consumption (SSC) is reduced, hence a smaller power station is required. 
Figure 2.67(b) shows the effect of the reheat, which is similar to the superheater, but on a lower 
pressure line. 
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Figure 2.67 Schematic diagram of the process with a reheater circuit 


The Mollier chart or enthalpy-entropy diagram for steam 

This chart, a section of which is shown in Figure 2.68, is used to find the state of steam at entry 
to and exit from turbines. It is useful because the ideal process is isentropic - constant entropy - 
which means there is a vertical line from one pressure and temperature state to a lower, known 
pressure. When the Mollier diagram is used with isentropic efficiency, the true exit state 
from the turbine can be easily determined by calculating the actual exit enthalpy and hence 
temperature. The change in specific enthalpy that is revealed by this is used to find the actual 
work produced by the turbines in the steam cycle. The chart is a quick and reasonably accurate 
method for finding the change in properties, avoiding the need for interpolating values in the 
tables of steam data, which is the alternative method. 

Use of feed heaters - regenerative cycle 

Feed heaters are used to reduce the external heat supplied (i.e. the heat drawn into the circuit 
external to the steam pipes, that is heat taken out of the furnace) at temperatures below T max . 
This is known as regenerative heating. They therefore increase the thermal efficiency. The 
reduction is achieved by bleeding off steam part way through the turbine expansion and using 
it to heat the feed water flow to the boiler. 

There are costs associated with this mode of operation since turbine work is reduced and 
specific steam consumption is increased. The plant is more complex (more expensive). 

There is an advantage in regenerative heating in that the LP turbine doesn’t have to be so big 
because it handles less steam. 

Two types of feed heaters are used: 

- open in which bleed steam and feed water are mixed irreversibly and adiabatically in a 
constant pressure process; 

— closed in which a recuperative heat exchanger is used with no mixing of the separate 
streams. 

In both cases, mass flow conservation and the steady flow energy equation (SFEE) are 
applicable, that is, inlet and outlet conditions are related by mass flow continuity and SFEE. 

(Note: recuperative is analogous to a recuperative heat exchanger — where the fluids exchange 
heat through a wall with no mixing.) 
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Figure 2.68 Section of the Mollier enthalpy vs. entropy diagram for steam process analysis for thermal energy systems. 
(PROATES® is a registered trademark of E.ON Engineering Ltd. Reproduced with the permission of E.ON) 
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Open feed heater 

The mass flow of steam is split after the HP part of the turbine 
so that only a proportion of it goes on to the low pressure (LP) 
turbine. 

The energy equation in the feed heater, assuming a perfectly 
insulated (i.e. adiabatic) chamber and referring to the state 
points indicated in Figure 2.69, becomes 

ymh 4 + (1 — y)mh 7 = mh 8 

in which y is the diverted proportion of the total steam flow 
rate, m, and h is specific enthalpy at the indicated locations 4, 

7 and 8 in the circuit. States 4 and 7 mix irreversibly, but at 
constant pressure, to produce a fluid at state 8, which is usually 
in the saturated liquid state. The liquid reaching the boiler 
in the saturated state then has the requirement of receiving 
sufficient heat to change phase rather than having to heat up to 
saturation first. Since the pressure is below the boiler pressure, 
an extra feed pump is needed after the open feed heater. 

P\= Pi = Ps 
tn 4 + m 7 = w 8 

The principle of energy conservation gives 
{mh) 4 + ( mh) 7 = ( mh) s 

The assumption of saturated liquid at the point before the 5 (kJ/kgK) 

pump is usually a reasonably good one to calculate the required Fjgure 2 , 69 A steam circuit with open feed heater 

bleed flow, m 4 . It can be shown that, for optimum efficiency, 

the pressure for bleeding is p $at for the temperature halfway 

between the average boiler and condenser temperatures, but the proof is beyond the scope 
of this section. With the assumption of reaching saturation temperature, the process diagram 
appears as shown schematically in Figure 2.69. Note that this is not a true process diagram 
because the mass carried round each part is not the same. The feed pump between 6 and 7 
increases pressure almost at constant entropy, before the liquid reaches the feed heater chamber, 
where it mixes with the wet steam, which has been bled out of the high pressure turbine, and 
the combined water leaves at saturated condition before being pumped up to the boiler steam 
pressure at l.When calculating the work out it is important to remember that the low pressure 
turbine does not have the full flow rate of steam from the boiler but a quantity less by the bled 
steam. 

Closed feed heater. 

In this case, as indicated in Figure 2.70, some of the expanding steam is bled from the HP 
turbine and passed through the feed heater, which raises the feed water temperature partially 
towards the saturation temperature at p x . The bled steam at 4 is cooled by heat exchange to 
state 8 and then throttled down to the condenser pressure. During the throttling process 
the expansion may lead to condensed water flashing off as steam at the lower pressure, for 
which a flash chamber is provided. The feed water temperature is raised from T 7 to 7\ 
during the process of internal heat transfer. Note that 1 is still liquid (subcooled liquid). 

State 8 is condensed liquid. The throttling process can form some steam, which can lead to 
erosion. 

Flash chambers are used to avert erosion by slowing down the large volume of steam. There is 
no mixing between streams. 


High pressure Low pressure 

turbine turbine 
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High pressure Low pressure 




Figure 2.70 Schematic of regenerative feed water heating using a closed feed heater 


The energy equation gives (assuming adiabatic conditions): 

(mh) 4 - ( mh) s = ( mh) x - ( mh) 7 
Since enthalpy is constant from 8 to 9: 

(mh) 8 = ( mh) 9 

Note: as before, this is not a true process diagram because the mass carried round each part is 
not the same. The inset picture in Figure 2.70 shows the detail at point 8 — the bleed steam 
condenses as it heats up the feed water from state 7 to state 1 before it enters the boiler. The 
spent bleed steam then goes through a throtde to the pressure of the condenser and joins the 
feed water. The throttle process is indicated by a dashed line since it is an irreversible process. 


Cycles with power and process steam/heat as useful 
outputs - combined heat and power 


On chemical process sites, there is often the requirement for steam and 
heat to be supplied in addition to there being a need for electrical power. 
Steam plant can be designed to meet all these demands. Steam is required 
at a much lower temperature than power steam. It would be wasteful to 
have a heating-only plant. 

Efficiency is redefined as, for example: 

net work + process heat 
^ heat supplied 



Efficiencies are higher than for conventional plant (70% typical) since the 
heat used for the process can be included in the benefit of the system. 

It is convenient to use plant for power and then extract heat from the 
condenser. The condenser is maintained at a relatively high pressure in 
order that T SAX = T RE Q UIRED .The turbine exhausting to the relatively 
high pressure condenser is called a ‘back pressure turbine’. This is practical 
when heat and power are fairly steady and well matched. When higher 
power is required, a ‘pass-out turbine’ can be used. This bleeds steam at an 
intermediate pressure to a heat exchanger before being fed back to the 
boiler. The amount of bled steam can be controlled. 
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Learning summary 

By the end of this section you will have learnt: 

/ a vapour power cycle operates on a condensable fluid in a closed cycle, receiving heat from a 
hot reservoir, usually a boiler, to evaporate the fluid before expanding through a turbine to a lower 
pressure in order to produce work; it is then condensed to remove heat at a low pressure to a cold 
reservoir. A pump then increases the pressure to the high pressure side of the circuit prior to the 
heating part of the cycle; 

/ enthalpy changes between state points on the cycle are used to calculate heat and work from the 
first law in the form of the steady flow energy equation; 

/ since vapour power cycles are widely used for power production, there are measures of the 
effectiveness of employment of the energy input to obtained output power. Thermal efficiency is the 
ratio of work output to externally supplied heat, i.e. the heat supplied from the hot reservoir. Work 
ratio is the ratio of net work (work output less work in) to work output, and gives an indication of the 
loss to compression. In the case of steam vapour power cycles, the specific steam consumption is the 
mass flow rate of steam per unit of power output, measured in kg per kW-hour; 

/ the basic cycle has been improved by altering it to improve the effectiveness. The basic cycle is 
the Carnot cycle, which uses evaporation between saturated liquid and saturated vapour and which 
has expansion and compression within the vapour-liquid mixture region. This is the most thermally 
efficient cycle given any maximum and minimum temperature available, and is based on the idea 
of the Carnot efficiency. The Rankine cycle is an improvement to the Carnot cycle, by simplifying the 
condensation part of the cycle such that all fluid is in the saturated liquid state, and the pumping 
work is thus significantly reduced. The average upper temperature can be increased by employing a 
superheat part of the circuit after initial boiling, thus increasing the overall thermal efficiency; this also 
improves the turbine operation by reducing expansion into the mixture region. Reheat in the circuit is 
included after the initial turbine expansion, in order to avoid entering the mixture region significantly 
and is used in coal-fired power stations; 

/ the Mollier chart or h^s diagram is a convenient representation of the processes in the vapour power 
cycle since it directly yields enthalpy changes, which are the heat and work exchanges with the 
working fluid. The process in the turbine is close to isentropic, and it is a simple exercise to compare 
the isentropic performance with the real performance by use of the isentropic efficiency; 

/ feed heating is often employed, using some of the heat in the steam to heat the water being fed from 
the condenser to the pump. This improves the thermal efficiency by reducing the heat required from 
the boiler; 

/ feed heaters can either mix the steam with the feed water, in an open feed heater or pass the fluids 
through a recuperator, in which the fluids are separate but exchange heat, in a closed feed heater; 

/ combined heat and power is used where a quantity of heat is required for chemical processing in a 
factory, or where domestic heating is required for long periods of time. The thermal efficiency is 
artificially improved by including some of the heat lost as useful output since it is put to good use. In 
this case, steam is taken at higher temperature out of the final turbine, at a point where there is still 
useful heat in the steam. 
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2.9 Reciprocating internal combustion 
engines 

Internal combustion engines were developed in the 19th century, producing petroleum- and 
diesel-fuelled engines that would subsequendy find their most common and well-known 
application in the automotive industry, for power units for motor vehicles, as well as for small- 
scale power generation. In this section, an overview of reciprocating engines is given. 



The reciprocating engine mechanism, shown schematically in Figure 2.73, consists of a piston 
that moves in a cylinder and forms a movable gas-tight plug, a connecting-rod and a crankshaft, 
similar to the construction of the reciprocating compressor. If the engine has more than one 
cylinder then the cylinders and pistons are identical, and all the connecting rods are fastened 
to a common crankshaft. The angular positions of the crank pins are such that the cylinders 
contribute their power strokes in a selected and regular sequence. By means of this arrangement 
the reciprocating motion of the piston is converted to a rotary motion at the crankshaft. There 
are many types and arrangements of engines and their classification defines them as described 
in the following. 



• air standard cycles 

• compress and expand isentropically 

Figure 2.74 Air standard cycles representing the Otto and Diesel cycles 


Otto and Diesel cycles 

Petrol and combustible gases have high volatility and are appropriate for spark ignition 
(SI) engines and a thermodynamic cycle based on the Otto cycle. Petrol is a complex mixture 
of distilled organic oil, the closest chemical to which is octane, or C g H 18 , which leads to the 
definition of the octane number, defining the quality of the fuel. The combustion is started 
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by spark ignition and allows for controlling timing of fuel detonation. The air standard Otto 
cycle is shown in Figure 2.74(a). An air standard cycle considers the processes in an ideal 
state, in which the heat is supplied and removed externally (rather than generated within the 
cylinder as it is in practice) and a single working fluid in a closed cycle is considered. The cycle 
consists ofisentropic compression (1-2) assuming no heat transfer through the cylinder 
walls and a rapid process; constant volume heating (2—3); the piston moves down and 
isentropic expansion occurs (3— 4); heat removed (4—1), and the cycle repeats. The power 
out of the cycle is the area in the cycle loop. 

Diesel and fuel oils have relatively low volatility and are appropriate for compression 
ignition (Cl) and a thermodynamic cycle based on the Diesel cycle. Since the fuel is a 
complex mixture of distilled organic oil, the chemical composition is not precisely definable, 
but a close approximation is cetane, or C 16 H 34 , and this is the ideal diesel fuel leading to the 
definition of cetane number defining the quality of the fuel. The combustion is started by 
compression ignition, which works by increasing the pressure to such a high level that the 
temperature is significantly raised, sufficient to commence a rapid deflagration or combustion, 
as compared to the explosive reaction occurring with more volatile fuels. The air standard 
Diesel cycle is illustrated in Figure 2.74(b). It consists ofisentropic compression (1—2); 
constant pressure heat addition (2— 3); isentropic expansion (3—4) and constant volume 
heat removal (4—1). 

The practical implementation of these cycles are the spark ignition engine, in which the air and 
fuel are mixed before compression and the compression ignition engine, in which the air only 
is compressed, and the fuel is injected into the compressed air that is then at a sufficiently high 
temperature to initiate spontaneous combustion. 


High pressure injector 



Figure 2.75 Schematic diagrams of engine types: (a) Spark ignition, four-stroke 
engine, (b) compression ignition, four-stroke engine, (c) spark ignition, two-stroke 
engine with BDC position indicated to show clearance of charge delivery and 
exhaust ports 


Cycle processes arrangement 

Figure 2.75 shows the practical differences between three engine types, which between them 
represent the common engine cycles used. The stroke of the piston is the distance it moves 
from the position most extreme from the crankshaft to that nearest it. This takes place over half 
a revolution of the crankshaft. In petrol engine practice, the extreme positions of the piston are 
referred to as top dead centre (TDC), and bottom dead centre (BDC). In oil-engine practice 
they are referred to as outer dead centre (ODC) and inner dead centre (IDC) respectively. 

An engine that requires four strokes of the piston (that is two revolutions of the crankshaft) 
to complete its cycle is called a four-stroke cycle engine. An engine that requires only two 
strokes of the piston (that is one crankshaft revolution) is called a two-stroke cycle engine. In 
all reciprocating internal combustion (IC) engines the gases are induced into, and exhausted 
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from, the cylinder through ports, the opening and closing of which are related to the piston 
position. In a two-stroke engine (Figure 2.75(c)) the ports can he opened or closed by the 
piston itself, hut in the four-stroke engine a separate shaft, called the camshaft, is required; this 
is driven from the crankshaft through a two-to-one speed reduction. The two-stroke engine 
has a piston head that is shaped to ensure that the charge delivery is directed upward into 
the cylinder away from the exhaust port to prevent a short circuit to the exhaust. The four- 
stroke petrol engine requires inlet and outlet valves. The diesel engine requires a high pressure 
injection port, and outlet valves. There may be a number of valves and ports in each cylinder to 
improve fuel distribution and combustion. 


Spark ignition engines are quantity governed. Air and fuel are mixed outside the cylinder, 
and the quantity of mixture induced is controlled by the throttle plate position. The throttle 
plate controls the air induced by restricting the inlet pipe, which may deliver to a distribution 
manifold or directly to each individual cylinder. Diesel engine are quality governed. They 
have minimum restriction to air flow into the engine (no throttle plate). Fuel is injected 
directly into the cylinder, under pressure, towards the end of the compression stroke. The 
amount of fuel injected controls the power output. 

Combustion initiation 

In the case of the spark ignition engine the air-to-fuel mixture is ignited by spark, 
typically at an angle of 10—40° before top dead centre (TDC) during the compression stroke. 
The compression ratio is approximately 8:1, such that at end of compression the gases in the 
piston are at eight times atmospheric pressure. The mixture ratio must be near stoichiometric 
(—14.5:1 by mass) for ignition by spark to occur. Current engines run at stoichiometric under 
most conditions, for efficiency of the emissions control system. The air— fuel ratio affects the 
power and economy — slightly richer causes increased power, slightly leaner causes improved 
fuel economy. A throttle or restriction on the air flow adjusts the air— fuel ratio. 

In the case of the compression ignition engine fuel injection takes place near the end of 
the compression stroke. Combustion begins, as the fuel mixes with the air in the cylinder, 
by self-ignition when compression heating raises the mixture temperature to 800-900K. 

The compression ratio in a Cl engine is greater than 12:1 in order to achieve the required 
temperature. These engines normally run at air-fuel ratios (AFR) in the range 20:1 to 25:1. 
They can run out to approximately 40:1 AFR. The air flow is not restricted or otherwise 
controlled upstream, the mixture being control by injection of the fuel at an adjustable rate. 


Control of power output 


p 



Spark ignition engine 


Referring to Figure 2.76, the characteristics of a real SI engine cycle can be 
identified as follows: 


1—2 Induction stroke. Air/ fuel mixture is drawn into the cylinder through 
the intake valve. The cylinder pressure is less than atmospheric due to losses 
across the valve, the intake manifold and the throttle plate. 


2- 3 Compression stroke. The mixture is compressed, and ignited by 
spark before TDC. Combustion proceeds at near constant volume at 
approximately 50° of crank rotation. Peak cylinder pressure occurs just after 
TDC, and is typically in the order of 25 bar for the wide open throttle 
condition (WOT). 

3— 4 Power stroke. The hot and high pressure combustion products do 
work on the piston, giving useful work output. 


Figure 2.76 Spark ignition cycle on the 
p-V diagram 


TDC 


BDC 
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4-1 Exhaust stroke. The energy-depleted combustion products 
are expelled through the exhaust port into the exhaust manifold. 
Cylinder pressure is slightly above atmospheric, due to the 
restrictions on the gas flow of the valves and pipes downstream from 
the engine. Notice that energy is lost when the exhaust starts as a 
free expansion from a pressure significantly higher than atmospheric. 

Overall, the cycle is distinctly different from the ideal Otto cycle, 
mainly because of the combustion process used to produce the 
heat internal to the closed volume, and the need to replenish 
the volume gases. The process is also distinctly different from the 
standard air Otto cycle. 


Diesel engine 

1- 2 Induction stroke. Air is induced into the cylinder - 
no fuel is present in the air at this stage. 

2— 3 Compression. The air only is compressed and fuel injection 
begins just before ODC and continues for about 20-30° of crank rotation. The injection is 
done at very high pressure in order to form a fine and quick atomized spray. The rail pressure 
has increased to the point where most efficient diesel engines now have up to 2000 bar. The 
combustion duration is longer than for an SI engine because it is nearer to constant pressure 
than constant volume. The rate of burn is controlled by the rate at which fuel evaporates and 
mixes with air in the cylinder. Peak cylinder pressure is in the order of 40 bar at full load. 

3- 4 Power stroke. The combusting mixture expands against the load applied to the engine. 

4— 1 Exhaust stroke. A stroke to remove the spent gases. Again notice the immediate loss of 
pressure at exhaust. 

Notice that the cycle is again distinctly different from the ideal Diesel cycle due to the real 
combustion processes used to provide the heat and due to the induction and exhaust strokes. 

Performance assessment 

For spark ignition engines (and similarly for diesels), the main concerns are fuel economy 
at idle and part load, wide open throttle power output, and emission levels. Also important, 
but less easy to quantify, are the characteristics of the engine response to changes in demand 
(throttle opening or closing). 

(i) Indicated power or ip: Rate of work done on the piston by the gas as determined from 
the p-V diagram. The top loop of the diagram is the power loop (positive work done by 
gases); the lower loop is the pumping loop (negative work done by gases). The indicated 
power is given by: 

ip = (imep) X LAN [W] 

where L = stroke, A = piston area, N = cycles/sec, that is, (engine speed/2 for four-stroke 
engine), imep is the indicated mean effective pressure. From the diagram: 

power loop area + pumping loop area 
lme P diagram base length equivalent to swept volume 

(imep values for spark ignition engines are ~5 bar at high load conditions). 

The above is sometimes referred to as the net imep. 

The gross imep is given by neglecting the pumping loop area. 

(ii) Brake power or bp: Measured power output, Tco, where T = brake torque, oj = engine 
speed (rad/s) 



Figure 2.77 Compression ignition cycle on the p-V 
diagram 
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(iii) Mechanical efficiency: T) m = j nc p cated p ower This is typically ~90% at full load. Losses 
are due to friction in the valve drive train, piston friction, bearings etc. 

m f 

(iv) Brake specific fuel consumption: bsfc. = — ; , where mr is the fuel mass flow 

' r brake power J 


This is a measure of engine efficiency. It is more commonly used than brake thermal 

^ . Brake Power 

efficiency, T, m = Indlcated Power 

where Q net is the lower calorific value of the fuel. (■% is around 35% for modern engines). 
Q net = A(7 0 


Volume induced/ cycle 

(v) Volumetric efficiency: r\ v = Swept Volume where the volume induced is 

referred to the conditions at inlet to the intake manifold. (WOT (wide open throttle) r\ v is 
typically >80%). 


Compression ratio and uncontrolled combustion 

Thermal efficiency increases with increasing compression ratio (r): 

L max Lb DC L clearance + Ldj sp i acet j 

r= K^n = VT^ = Clearance 

This reflects the increase in the upper temperature of the cycle, which increases thermal 
efficiency (recall that T) Carnot = 1 — ^, MIN ) . In addition, there is less residual gas left in the 

cylinder at the end of the exhaust stroke to dilute the fresh change for the next cycle. Spark 
ignition engines have rvalues in the range 9— 10. These are unlikely to increase dramatically for 
the following reasons: 

(1) Production tolerances. High r values mean larger variations from build-to-build for 
given tolerances on dimensions. 

(2) Spontaneous ignition. If the r value is increased progressively, temperatures during 
compression will increase, and eventually the charge will start to burn spontaneously before 
spark ignition. 

(3) Knock. This is a problem even on current engine designs, and is more likely to occur 
when r is increased. This is an uncontrolled combustion phenomenon. Combustion starts 
normally by spark ignition but unburnt gas ahead of the flame front is compressed. If 
the unburnt gas temperature is raised sufficiently it will self-ignite as in (2), producing a 
characteristic knocking sound. 

In some circumstances, the flame front is accelerated and propagates across the combustion 
chamber with a shock wave. This detonation wave is reflected back and forth giving rise to 
high frequency noise. Knock phenomena can cause serious overheating and loss of efficiency. 

(4) Pre-ignition. One effect of knock is to produce local hot spots on the combustion 
chamber wall that can initiate combustion before the spark on following cycles. Similarly, 
deposits on spark plugs and walls can also produce hot spots. 

In Cl engines, r is higher, but similar issues occur, due to the nature of the combustion. There is 
initial delay as fuel and air mix, followed by initial spread of the flame, and finally by combustion 
as the fuel is fully injected. Diesel knock may occur if the initial stages occur too rapidly. 


Spark timing 

In SI engines, maximum work output is achieved when combustion occurs atTDC, at constant 
volume. In practice, this is not achieved. The best spark timing gives peak pressures, which 
occur at approximately 15° after TDC. To achieve this, spark ignition occurs before TDC. 
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At a given nominal load, speed and AFR (air to fuel ratio), the optimum timing is normally 
taken as the ‘minimum advance for best torque’ (MBT timing). Usually this is determined 
experimentally, and timing requirements mapped out as a calibration for engines of a given 
design. Flame speeds are approximately proportional to engine speed, so although engine speed 
varies by an order of magnitude between idle and maximum engine speed, the duration of 
combustion requires a similar crank angle interval. Even so, timing is advanced from about 10° 
before TDC at idle and 30° before TDC at high speed, full load. 

In addition, timing is adjusted to compensate for changes in load.Timing is advanced under 
light load conditions by up to about 20°. This compensates for charge dilution by residual 
combustion products, and low cylinder pressures and temperatures, which are not conducive to 
good combustion. 

Modern electronic ignition systems perform adjustments electronically, but still use engine 
speed and manifold vacuum (or air flow rate) to identify timing requirements. 


Fuelling systems 


On spark ignition engines, fuel injection systems have largely replaced carburettors, although 
the basic function is the same. The carburettor meters fuel into the throat of a venturi, such 
that the AFR is approximately constant for all running conditions. The air mass flow rate 
through the venturi depresses the throat gauge pressure, and fuel flow has the same functional 
dependence on this pressure. This is because, as the constriction occurs, the air flow speeds 
up and the static pressure drops correspondingly, as seen in Part 1 in Section 3.3 on venturis. 
Mixture strength is increased at idle (to compensate for poor combustion conditions due to 
residual dilution in particular) and at wide open throttle (to give maximum power output, 
which is then limited by the availability of air). 

Current fuel injection systems are multipoint (most expensive) or single-point (still more 
expensive than a carburettor). In multipoint systems, fuel is injected into each tract of the 
intake manifold near to the intake valve. One fuel injector per cylinder is required. Single-point 
system inject fuel into the air flow before it is split into separate routes to individual cylinders. 


Multipoint systems (either electronic, EFI, or mechanical, MFI) are most common. Fuel is 
injected in discrete shots lasting approximately 10 msec. Fuel supply pressure (3 bar) is much 
lower than diesel fuel injection systems, and the injection timing is not critical. For diesel 
engines, fuel injection is inherently necessary. Each cylinder has an injector that fires fuel into 
the combustion chamber directly (direct injection diesels) or into a prechamber connected to 
the combustion chamber (indirect injection diesels). The injection system is sequential (each 
cylinder receives the fuel charge at the same point in its cycle). Because the injection is into the 
cylinder, late in the compression stroke, the fuel supply pressure is much 
higher (200 bar and now much higher pressures are available) than 
for spark ignition engine systems. The injection spray characteristics 
(droplet size, distribution, direction etc.) are also more critical since this 
directly influences the way in which combustion progresses. 

Supercharging j* 

The loss in the exhaust due to the sudden expansion when 
exhaust commences, can be used to benefit the charging part of 
the cycle. This is done generally on diesel engines to very useful 
effect in improving engine efficiency. The exhaust gases are 
passed over a turbine, which is directly linked to a compressor 
that delivers the air to the inlet manifold. The compressor turns 
at speeds in the order of tens of thousands of rpmThis is called a 
turbocharger, and the basic idea is indicated on the p- V diagram 
in Figure 2.78. It improves the volumetric efficiency since the ai 
in to the cylinder is at a higher pressure. 



Figure 2.78 Principle of the use of a turbocharger to 
improve volumetric efficiency 
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Learning summary 

By the end of this section you will have learnt: 

/ internal combustion engines are either operated on the Otto cycle, in which case they work with 
petrol and are called spark ignition engines, or on the Diesel cycle, in which case they work with 
diesel fuel and are called compression ignition engines; 

/ diesel fuel may be mainly considered as cetane, C 16 H 34 . Petrol fuel may be mainly considered to 
be octane, C 8 H 10 ; 

/ both cycles are based on the ideal standard air cycle versions of the cycle, which assume external 
heat supply and ideally enacted processes. In reality the cycles vary from this, mainly due to the 
nature of heat addition by internal combustion; 

/ SI engines have air-fuel ratios of approximately 14.5:1 (near stoichiometric) and a compression 
ratio of approximately 8. They are quantity governed, in that the rate of air flow is adjusted to alter 
power and economy; 

/ Cl engines have air-fuel ratios between 20:1 and 25:1 and compression ratio in excess of 12. 

They are quality governed, in that the air flow is unrestricted, and the injection of fuel controls the 
power and economy; 

/ there are several measures of engine performance: indicated power is the power directly from the 
p-V diagram; brake power is the measured power at the engine shaft; mechanical efficiency is the 
ratio of brake power to indicated power; brake specific fuel consumption, which is the fuel required 
for a specific brake power; volumetric efficiency, which is volume induced / swept volume, similar 
to that for air compressors; 

/ the compression ratio affects the combustion performance and can lead to poor combustion and 
hence poor mechanical behaviour if handled badly. In particular, it can lead to a noisy combustion 
behaviour known as knock or diesel knock; 

/ spark timing is important for SI engines in order to provide the maximum pressure at the right 
moment in the cycle. The spark occurs a few degrees in advance of top dead centre position in 
order to create maximum pressure at top dead centre; 

/ fuelling systems for SI engines rely on injection of a premixed air and fuel mixture. Traditionally, 
carburretors were used, but recently these have been superceded by fuel injection. For Cl engines, 
injection is necessary for the cycle to work, and the improvements have come from injection 
strategies due to electronic control and higher pressures available for fuel injection; 

/ the exhaust gases are ejected significantly above atmospheric pressure. This expansion can 
be used in a turbocharger to drive a turbine, which in turn drives a compressor at a very high 
rotational speed. The compressor delivers air to the cylinder significantly above atmospheric 
pressure, thus increasing the volumetric efficiency. 
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UNIT OVERVIEW 

■ Introduction 

■ Combined loading 

■ Yield criteria 

■ Deflection of beams 

■ Elastic-plastic deformations 

■ Elastic instability 

■ Shear stresses in beams 

■ Thick cylinders 

■ Asymmetrical bending 

■ Strain energy 

■ Fatigue 

■ Fracture mechanics 

■ Thermal stresses 


3.1 Introduction 

An Introduction to Mechanical Engineering: Part l covered the basic principles of solid mechanics 
(Unit 1) including basic mechanics and design analysis, stress and strain and the analysis of 
simple engineering loading situations such as uniaxial loading, beam bending, multi-axial 
stresses and torsion. This basic introduction assumed that all materials behaved in a linear elastic 
way and focused on estabHshing fundamental principles and simplified methods of analysis to 
solve for stresses and strains within components and structures. 

In this unit, the application of solid mechanics to engineering problems is taken to a greater 
depth. Further elastic analysis is considered including such aspects as combined loading applied 
to components; further bending analyses, such as shear stresses in beams; the calculation of 
bending deflections; and the bending of beams with asymmetric sections. Powerful methods 
of analysing elastic deformations in more complex shaped structures using the concept of 
stored strain energy is also included. The analysis of more detailed stress distributions, in thick 
cylinders for instance, is also explained. The purpose of these further elastic analyses is to 
determine stresses, strains and deformations under more complex loading and where geometry 
varies in a more complex way. 

Stresses and strains are calculated for a reason: to design components fit for purpose. In this 
sense, it is important that we consider how components might fail under loading. The unit 
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therefore also includes several sections on failure, including yield criteria to assess when failure 
might occur, analysis beyond the elastic region, where elastic-plastic behaviour becomes 
important and the analysis of particular types of failure other than yielding, including brittle 
fracture and fatigue (under cyclic loading). We learn how to analyse components under 
such conditions and how to design against such failures. Finally, failure of components is not 
necessarily restricted to excessive mechanical stresses but may be caused by other factors such 
as temperature or excessive deformations. The unit therefore also covers the analysis of thermal 
stresses and situations where elastic instability — buclding - might occur. 

All of the above topics are covered in depth by firsdy introducing the basic concepts and 
theory, secondly developing standard methods for analysis and finally illustrating these methods 
with many worked examples. At the end of the unit students will have learnt a wealth of easily 
usable analytical methods for solving many practical engineering problems. 

3.2 Combined loading 

Many engineering problems can be analysed as simple loading situations e.g. uniaxial loading, 
beam bending, torsion etc. However, it is also very common in the real world for engineering 
components and structures to be subjected to several loads simultaneously. This is a combined 
loading situation and can be analysed by superposing the effects of the individual loads. 

Mohr’s circle - recap 

Mohr’s circle for plane stress was introduced in Introduction to Mechanical Engineering: Part i as 
a useful graphical technique for analysing plane stresses acting on an element in a material or 
structure. For combined loading situations, it is common to reduce the problem to such a plane 
stress problem and analyse using Mohr’s circle. The analysis will give the principal stresses, the 
maximum shear stresses and the angles of the principal planes for the element. Figure 3.1 shows 
a shaft subjected to combined loading of a torque, T, and a compressive axial load, P. Let us 
assume that such loading gives rise to an axial stress of — 12 MPa (i.e. compressive) and a shear 
stress of —6 MPa (i.e. causes the element to rotate clockwise) acting on a surface element as 
shown in the figure. The Mohr’s circle analysis is then as follows: 

The known stresses on the element are: 
ct x = -12 MPa 
dy — 0 MPa 
T xy = -6 MPa 
T yx = 6 MPa 

Figure 3.2 shows Mohr’s circle for this stress system. To 
draw the circle, firsdy draw the point B which represents 
stresses on the x-plane (coordinates: —12, —6). Next 
draw point E which represents stresses on the y-plane 
(coordinates: 0, +6). Join the two points with the line 
BE that intersects the x-axis at the centre of the circle, C. 

The circle can now be drawn and the following quantities 
measured: 

(Ti = 2.5 MPa 
u 2 = -14.5 MPa 
T max = 8.5 MPa 
26 = 45° 

On the element, the angle of the principal plane (PI) from the y-plane is 0 = 22.5° 
anticlockwise as shown in Figure 3.2. 





6MI>a4 — 

0 _J2 MPa 

► 6 MPa 

Figure 3.1 Shaft subjected to combined torque and 
compressive axial loading 


An Introduction to Mechanical Engineering: Part 2 



Figure 3.2 Mohr’s circle for combined torsion and compression 


Alternatively, the important parameters in the circle can be calculated analytically as follows: 
The centre of the circle is given by C = (cr x + <J y )/2 = —6 
o\, — cr v \ 2 


The radius of the circle is given by R = ^ J + T xy 2 =8.5 


The principal stresses are: 

(Ti = C + R = 2.5 MPa 
d 2 = C-R = -14.5 MPa 
T max = R = 8.5 MPa 


The angle of the principal planes: 


20 = 



= 1 


20 = 45° 
0 = 22.5° 


If the analytical approach is taken (which does give the more accurate results), then it is always 
advisable to sketch Mohr’s circle in order to gain a clear understanding of the orientation of the 
principal planes and the maximum shear planes with respect to the x- or y-planes. 


Superposition of combined loads 

The principal of superposition states that: 

[The total effect of combined] _ y^TThe effects of the individual j 
[loads applied to a body J ^ [loads applied separately 

Thus, when a body or structure is subjected to a combination of different types of loading 
simultaneously, we can consider separately the effects of each load on the local stresses that act 
on an element. Stresses on the element can then be summed to determine the effect of the 
combined loading. A number of combined loading examples can be used to illustrate: 

Combined bending and axial loading 

Figure 3.3 shows a beam carrying a uniformly distributed load (UDL) along its span, while 
simultaneously being subjected to an axial compressive force, F. The figure shows how the 
effect of the combined loading on the stress distribution through the thickness of the beam 
at the centre of its span is determined. The effect of the UDL and the axial force are obtained 
separately and then summed to give the combined stress distribution in the beam. The 
symmetrical bending stress distribution about the neutral axis is essentially skewed to more 
compression by the effect of the axial stress. 
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UDL 


^ gQcioao^ | | % Ly|| | . | = | 



Bending 
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distribution 


Figure 3.3 Combined bending and axial loading 


Combined bending and torsion 

Figure 3.4 shows a similar beam to the above example, except now the beam carries a torque 
instead of the axial load. This loading situation is typical of a shaft with self- weight (UDL) 
transmitting a torque. In this case, the beam cross section can be assumed to he a solid circle 
with diameter (I. The stresses at the centre of the span, at the bottom surface of the beam, are 
given by the usual bending and torsion equations as follows. 

Arising from the UDL, bending stress (cr B ): 


My 
CT B = ~j~ 

where y = d/2 

Arising from the torque, torsional shear stress (t) 


where r = d/2 

These two stresses can be superposed and illustrated as acting on an 
element at the surface of the beam, as shown in Figure 3.4. 

Mohr’s circle can now he used for this element to obtain the principal Figure 3.4 Combined bending and torsion 
stresses and maximum shear stress at this position. 

Combined pressure, axial and torsional loading 

A combination of three loads can be illustrated by considering a thin-walled cylinder, as shown 
in Figure 3.5, subjected to an internal pressure, P, an axial tensile force, F, and a torque, T. 

Figure 3.5 shows the stresses arising from each load separately acting on a surface element 
in the plane of the cylinder wall. Superposition of these three stresses are also shown on the 
element. Mohr’s circle can again be used to obtain the principal stresses and maximum shear 
stress for the element. 


Torque (T) UDL Torque 

e rf°i 3 

-Ok 
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□ PR/2t + F/A 



PR/2t + F/A 


PR/t 


Figure 3.5 Combined internal pressure, axial tensile force 
and torque 


Methodology for combined loading 

The methodology for analysing components or structures under combined loading can now be 
summarized: 

(i) Identify a 2D element at the location of interest in the component. 

(ii) Determine the stresses acting on the element arising from each individual load. 

(iii) Superpose the stresses from each individual load to obtain the combined stresses on the 
element. 

(iv) Use Mohr’s circle to determine the principal stresses and the maximum shear stress on the 
element. 


Worked example 

Combined bending and torsion — 
offset loading on a cantilever 

Figure 3.6 shows a solid circular 
cross section cantilever beam, 
of length L and diameter d, fixed 
at one end. Attached at the free 
end of the beam is a crank arm 
which allows a vertical load, P, to 
be applied at an offset distance, a, 
from the axis of the cantilever. 

Determine the maximum shear st 
on the upper surface at the fixed support Figure 3.6 Combined bending and torque 
of the cantilever beam (position A). acting on a cantilever beam 

The following load and dimensions apply: 


P = lkN 
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We consider the stresses acting on a small surface element at position A. The load gives 
rise to a bending moment and torsional moment at the cross section at position A as 
follows: 

Bending moment M = P L 
Torsional moment T = P ■ a 

These moments give rise to separate bending and shear stresses acting on the element at 
position A which can be superposed to give the total effect of the combined loading as 
shown in Figure 3.6. The stresses are: 

, . PL— 

My 9 V) PT 

Bending stress: ct b = — = — — = = 75.45 MPa 

~64 

Pa l 

„ . . , t Tr a 2 16 Pa 

Torsional shear stress t = ~r = — ~r = — jr = 22.64 MPa 

J TUp 

32 



Figure 3.7 Mohr’s circle for combined bending and torsion 

Mohr’s circle can now be drawn for the element to determine the maximum shear stress 
as shown in Figure 3.7 The coordinates of point B on the circle (u b ,t) correspond to 
the stresses on the element in the longitudinal direction, i.e. along the cantilever. Point 
E corresponds to the stresses (0 ,— t) in the transverse direction to this. The line joining 
these two points defines the diameter of the circle and, where it crosses the tr-axis, the 
centre, C.The circle can now be drawn and its radius measured to give the maximum 
shear stress as follows: 

Tmax = radius = 44 MPa 
Alternatively, by calculation: 

Given the element stresses 
<j x = 75.45 MPa 
(jy — 0 MPa 
T xy = 22.64 MPa 


The maximum shear stress, T max = radius 



= 44 MPa as before 
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Learning summary 

By the end of this section you should have learnt: 

/ the basic use of Mohr’s circle for analysing the general state of plane stress; 

/ how the effect of combined loads on a component can be analysed by considering each load as 
initially having an independent effect; 

/ how to use the principle of superposition to determine the combined effect of these loads. 


3.3 Yield criteria 


Elastic-plastic deformations 


Uniaxial tensile tests (Figure 3.8) are used to obtain some 
important material properties. Typical ‘stress— strain curves’ 
are shown in Figure 3.9. 

A linear elastic range is exhibited by a number of 
materials. In this region the strain is proportional 
to the stress (Hooke’s Law) and on removal of the 
stress, the strain returns to zero. The constant of 
proportionality is the Young’s Modulus, E, for the 
material. If the stress exceeds a certain value, i.e. the 
yield stress, a y , the strain does not return to zero on 
removal of the load. Table 3.1 gives typical values ofE 
and CT y for some common engineering materials. 

A tensile specimen, with a circular cross-section, made 
from mild steel, which is a ductile material, breaks 
with a ‘cup and cone’ mode of failure, see Figure 3.10, 
the cone angle is ~45°. 


r 



i 

Figure 3.8 Uniaxial 
tensile loading 



behaviour for mild steel 



Figure 3.10 Cup and cone failure for mild steel and Mohr’s circle for 
uniaxial tensile loading 


Material 

L’(GPa) 

a y (MPa) 

mild steel 

200 

350 

copper 

120 

310 

brass 

100 

390 

aluminum alloys 

70 

140-500 


Table 3.1 Typical values of E and u y for common engineering materials at 20°C 
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A Mohr’s circle for the loading condition, see Figure 3.10, indicates that the 45° plane is that 
on which the maximum shear stress, T, occurs. 


If a circular cross section mild steel bar is subjected 
to torsional loading (Figure 3.11), the torque 
angle response is as shown in Figure 3.12. For T 
=£ T), T ex 0, and the torque— angle behaviour is 
reversed on removal of the torque. If the torque 
is continuously increased until failure occurs, 
the fracture plane is transverse to the axis of the 
specimen (Figure 3.13). A Mohr’s circle for the 
torsion loading case, Figure 3.13, indicates that 
failure occurs on the maximum shear stress plane. 



Figure 3.11 Pure torsional 
loading for a circular 
cross section bar 




for mild steel 


Figure 3.13 Transverse failure mode exhibited by mild steel and pure 
torsional loading and corresponding Mohr’s circle 


Therefore, the results of both the tension and the torsion tests indicate that 
failure occurs on the planes that contain the maximum shear stresses. This 
behaviour is similar to that of many ‘ductile’ materials. 

Similar torsion tests on ‘brittle’ materials (e.g. cast iron) give different behaviour. 
The torque-angle response is as shown in Figure 3.14 and the fracture occurs 
on a 45° helix, as shown in Figure 3.15. 




for cast iron 


Figure 3.15 The 45° helix failure mode exhibited by mild steel under 
pure torsional loading and the corresponding Mohr’s circle 
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A Mohr’s circle for this loading condition shows that the point on the Mohr’s circle associated 
with the maximum principal stress is 90° (ccw) from point 1. Therefore, this represents a plane 
at 45° from plane 1 on the element. This maximum principal stress, ct, plane corresponds to 
the 45° helix angle of the fracture on the surface. This indicates that failure in this material has 
occurred on a plane on which the maximum principal stress exists. 

The tests and stress states used to come to the above conclusions for ‘ductile’ and ‘brittle’ failures 
are very simple and it would therefore be unwise to base failure criteria on this evidence alone. 


Yielding of ductile materials 

The topic of yield criteria is limited to the prediction of the initiation of yielding in ductile 
materials. A yield criterion closely related to the maximum shear stress (Tresca) criterion is the 
maximum shear strain energy (von Mises) criterion; these two criteria generally provide a good 
indication of yield and are widely used in elastic— plastic analysis. 


Maximum shear stress (Tresca) yield criterion 

The Tresca yield criterion states that the material will yield 
when the maximum shear stress in the material exceeds a 
limiting value. 


If a - !, <r 2 ar >d (r 3 are the three principal stresses (cr j > ct 2 > ct 3 ) 
then, Figure 3.16 shows that 



The limiting value can he related to the uniaxial yield stress, fj y , 
obtained from a uniaxial tensile test. In this case, 

O'! = CT y and a 2 = 03 = 0 

(T y — 0 CT y 

T max = 2 = T 

The T max criterion therefore states that the material will yield if: 
cti — tr 3 3= a y (o^ > ct 2 > cr 3 ) 



Maximum shear strain energy (von Mises) yield criterion 


The von Mises yield criterion states that the material will yield when the maximum shear 
strain energy (per unit volume) exceeds a limiting value. If o, , (j 2 and cr 3 are the three principal 
stresses ( 0 ^ > tr 2 > cr 3 ) then: 


shear strain energy 


volume = 12G« CTl “ ^ + ^ ^ + (a 3 “ <n) 2 } 


Again, the limiting value can be related to the uniaxial yield stress, o y , obtained from a uniaxial 
tensile test. Thus, at yield 0^ = o\, and cr 2 = cr 3 = 0: 

shear strain energy 1 

— = — - — S2& 2 1 

unit volume 12 G’ " ' 


The von Mises yield criterion can thus be expressed as follows: 

12g{(»i - ct 2) 2 + (0-2 - <Ts) 2 + (°3 - <7i) 2 } ^ 12G {2(7 ^ 

which can be reduced to the following, more common expression for the onset of yield, 
according to the von Mises yield criterion: 

(<Ti - cr 2 ) 2 + (ct 2 - ct 3 ) 2 + (cr 3 - CT1) 2 2 (Ty 2 
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Two-dimensional stress systems (i.e. a 3 = 0) 

For plotting purposes here, and ct 2 can take on any values, i.e. <J\ is not necessarily always 
greater than u 2 .The yield boundaries on the oq-CT 2 plane are shown in Figure 3.17. 



Figure 3.17 The yield boundaries, for a two-dimensional stress 
state, on the - ir 2 plane (<r 3 = 0). 


In general, the von Mises yield criterion is easier to handle analytically because it is continuous. 
This is particularly important for the calculation of incremental plastic strains, since the 
plastic strains are related to the normals to the yield surface, and at the corners of the Tresca 
yield locus there is ambiguity about the directions of the normals, whereas there is no such 
ambiguity about the von Mises yield locus. 

Three-dimensional stress systems 

The Tresca yield criterion 

CTj — (r 3 = <r y (a - ! > a 2 > cr 3 ) 
and the von Mises yield criterion 

(o-i - c Tz) 2 + (tr 2 + a 3 ) 2 + (a 3 - ct^ 2 - 2a y 2 
are not altered if a constant stress component, say a, is added to each stress component, i.e. 

(a 3 + ct) — (ct 3 — a) = CTj — a 3 = a y 

and 

((o-i + <T) ~ (ct 2 + ct)) 2 + ((ct 2 + a) - (tr 3 + a)) 2 + ((ct 3 + a) - (a t + a)) 2 
= (CT! - cj 2 ) 2 + (cj 2 - ct 3 ) 2 + (ct 3 - ctO 2 = 2ct/ 

This implies that the addition of a ‘hydrostatic stress state’, i.e. Od = ct 2 = ct 3 = ct does not 
change the shapes of the yield surfaces shown in the section on two-dimensional stress systems. 

The mean principal stress (J/, = + ct 2 + ct 3 ), which is known as the hydrostatic 

stress for a given stress state (ctj, ct 2 , ct 3 ), is the stress that causes volume change. Now, the 
independence of the yield criteria with respect to hydrostatic stress means that the three- 
dimensional yield criteria are prismatic surfaces with the axes of the prisms in each case being 
the line oq = cr 2 = u 3 .This is called the hydrostatic line in 3D stress space 

(Haigh-Westergaard stress space) and it has direction cosines |^t=, y=, — =j.The yield 

boundaries can thus move any distance in the direction = cr 2 = (T 3 .The yield surfaces for 
both the von Mises and the Tresca yield criteria therefore have a constant oblique section and 
hence a constant perpendicular cross section, whose true shape can be seen in the view along 
the line CTj = ct 2 = cr 3 . Any arbitrary stress ‘vector’ (ctj, ct 2 , ct 3 ), e.g. OB and OD, (Figure 3.18), 
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in the stress space can be decomposed into two com ponents, 
one parallel to the hydrostatic line, e.g. O A an d OC, a nd one 
perpendicular to the hydrostatic line, e.g. AB and CD. The 
oblique planes that are perpendicular to the hydrostatic line 
are called deviatoric planes and are given by equations 
of the form aq + <r 2 + a 3 = constant, each representing 
a different level of hydrostatic stress. The deviatoric plane 
with aq + ct 2 + a 3 = 0 is known as the Tt-plane. It can be 
shown that the component of (aq, a 2 , a q) pa rallel 
to th e hydrostatic line is (oq, oq, (T h ), e.g. OA and 
OC, while the component parallel to the devia toric 
planes is (oq — oq, a 2 — oq, o 3 ~ cr^.AB and CD. 

Only the latter component of stress is important in 
determining yield according to the von Mises and 
Tresca criteria. 

The view along the oq = a 2 = a 3 Hne of the von Mises and Tresca 
yield criteria is an isometric view that shows the three axes included 
at 120° intervals. This is sometimes called a view on the Tt-plane, 
on which the Tresca yield surface is a hexagon and the von Mises 
yield surface is a circle. Therefore, large principal stresses do not 
necessarily result in yield; it is the stress differences and the route to 
the final stress state that govern whether yielding will occur. 

Figure 3. 19 can be used, instead of the equations, to decide whether 
a certain stress state will be safe. Simply plot on the diagram each of 
the three principal stresses parallel to each of the three axes and see 
whether the final point lies inside the appropriate yield surface. 


Hydrostatic ai 
i°2 (oq = <t 2 = aq) 




Figure 3.18 (a) Representation of yield surfaces 
for a three-dimensional stress state and (b) the 
decomposition of the stress into hydrostatic and 
deviatoric components 


Note: 

(i) The yield condition can be examined by either using the appropriate equation, i.e. 
Tresca: oq — a 3 = o\, (oq > (t 2 > a 3 ) 
von Mises: (oq — a 2 ) 2 + (o 2 — oq) 2 + (a 3 — aq) 2 = 2 ay 2 
or by plotting principal stresses on the Tt-plane. 


(ii) All three principal stresses are important. At free surfaces the normal stress is usually zero, 
but it may be important, particularly if the other two principal stresses are of the same sign. 
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Learning summary 

By the end of this section you should have learnt: 

/ the difference between ductile and brittle failure, illustrated by the behaviour of bars subjected to 
uniaxial tension and torsion; 

/ the meaning of yield stress and proof stress, in uniaxial tension, for a material; 

/ the Tresca (maximum shear stress) yield criterion and the 2D and 3D diagrammatic representations 
of it; 

/ the von Mises (maximum shear strain energy) yield criterion and the 2D and 3D diagrammatic 
representations of it. 


3.4 Deflection of beams 


In this section we address the important topic of beam deflections. The design of engineering 
structures and components is very often dictated by the strength of the materials used and 
consequently the stresses within the structure, hut often the limiting factor is the allowable 
deflection. This is particularly important for engineering artefacts made from materials of lower 
stiffness, such as aluminium, plastics or composites, but may also be critical for high stiffness 
steel structures comprising slender flexible members. It is therefore important as part of the 
design process to be able to calculate maximum deflections in a structure in addition to the 
position at which they occur. 

This section focuses on deflections in beam structures. Following the derivation of the 
fundamental deflection equation for a beam, a flexible procedure is introduced, called 
Macaulays method, which allows deflections to be calculated at any position along a beam 
span. In particular, the method allows us to deal with different types of loading such as point 
loads, uniformly distributed loads (UDLs) and point moments, including discontinuities in 
these loads. Although not the only method for calculating deflections, as we will see later in the 
strain energy section, it is a particularly powerful and flexible method. 

Equation of the elastic line 

The section of span of a beam, shown in Figure 3.20, is under 
pure bending, i.e. there is a constant bending moment along 
this section and no shear force. Under pure bending conditions, 
the neutral surface of the beam is a circular arc with radius of 
curvature, R, as shown. The transverse deflection of the neutral 
surface is given by the coordinate y of any position along this 
surface. 

(N.B. Do not confuse this ‘y definition of deflection with the 
‘y denoting distance from the neutral axis/surface in the beam 
bending equation - as detailed in An Introduction to Mechanical 
Engineering: Part 1. 

The line denoting the neutral surface in Figure 3.20 is known 

as the elastic line or the deflection curve of the beam. 

Referring to Figure 3.20, the angle d0 defines an element AB, ds in length, of the elastic line, 
and, because d0 is small, 

ds = Rd0 



Elastic line 
(deflection curve) 

yT 

Figure 3.20 The elastic line (i.e. deflection curve) of a 
beam under pure bending 
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in other words, the curvature, 


R 


d0 

ds 


(3.1) 


For a beam bent in the elastic range, the elastic line is a very flat curve i.e. deflections are small. 
In this case, 


ds ~ dx 


where dx is the increment of x between A and B, and equation (3.1) becomes 

j_ = _de 

R dx 


(3.2) 


Note the introduction of the minus (— ) sign because 0 decreases as x increases for positive 
bending, i.e. sagging. This maintains a positive radius of curvature for positive bending. 


Also, 0 is the slope of the elastic line at position A, 



Combining equations 3.2 and 3.3 gives, 


R 


d 2 y 


dx 2 


(3.3) 


(3.4) 


And finally, combining equation (3.4) with the part of the beam bending equation relating 
radius of curvature to the bending moment (M), Youngs modulus ( E ) and second moment of 
area (7), i.e., 


M = E J_ _ M 
I R ° r R El 


gives, 


d 2 y 


(3.5) 


(3.6) 


Equation (3.6) is the differential equation of the elastic line, relating the deflection, y, to 
the applied bending moment, M, the Youngs modulus, E, and second moment of area. 7. The 
product of E and 7, i.e. El, is termed the flexural rigidity of the beam. 

Successive integration of equation (3.6) will yield the slope, dy/ dx, and the deflection, y, at all 
points along the beam. 

Equation (3.6) has been derived for the case of pure bending, i.e. constant bending moment 
along the section, and does not take into account deflections due to shear. For long slender 
beams, shear deflections can be neglected. 

Equation (3.6) may also be integrated, and solutions found, if the bending moment, M, is a 
continuous function of x. This condition applies only to beams under specific simple loading 
conditions. A complication arises where discontinuities in M exist, such as where there are 
point loads and point moments or where there is an abrupt change in distributed loading. 

Various methods have been developed to solve such problems with discontinuities. In the 
next section we will introduce and develop the procedure called Macaulay’s method, a 
versatile solution procedure that can handle most discontinuities that we are likely to come 
across. 
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Macaulay’s method (also termed the method of singularity functions) 

Named after the mathematician WH. Macaulay, this method uses a mathematical technique 
to deal with discontinuous loading. The moment expression M(x), i.e. M as a function 
of x, is replaced with the step function M<x — a>, in which a defines the points where 
discontinuities arise. The procedure will now be developed from first principles. 

Figure 3.21 shows a simply supported beam carrying three point loads W A , W B and Wq , 
along its span. Each of the three loads gives rise to a discontinuity in the bending moment. 
Knowing the applied loads, the end reactions jRj and R 2 can be calculated from equilibrium 
conditions for the whole beam. Now, considering each part of the span between the loads 
separately: 



Figure 3.21 Simply supported beam carrying 
point loads 


Span 1 

Figure 3.22(a) shows the free-body diagram of the left-hand end of the beam, 
cut at a section in span l.The unknown bending moment, M, and shear force, 

S, at this section are shown in the diagram. Considering the bending moment 
only and taking moments about the cut section, 

M- R lX = 0 

and combining with the equation of the elastic line, 
d 2 y 

EI^=-M=-R x x (3.7) 

This equation applies to span 1 only. Now consider span 2. 

Span 2 

Figure 3.22(b) shows the free-body diagram of the left-hand end of the beam, now cut at 
a section in span 2. As before, the unknown bending moment, M, and shear force, S, at this 
section are shown in the diagram. Again, considering the bending moment only and taking 
moments about the cut section, 

M + W A (x ~ a) ~ R lX = 0 

And, again, combining with the equation of the elastic line, 
d 2 y 

El -j^ = -M = -R t x + W A (x - a) (3.8) 

This equation now applies to span 2 only. 

Similar equations can be derived for spans 3 and 4 as follows. 

Span 3 

d 2 y 

El -j^ = -M= -R lX + W A (x - a) + W B (x - b) (3.9) 



Figure 3.22 Free-body diagrams 
of beam cut at different sections 


Span 4 


d 2 y 

EI-r-1 = “M = ~R t x + W A (x - a) + W B (x - b) + W c (x - c) 
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It is interesting to note that the forms of equations (3.7), (3.8), (3.9) and (3.10) are very similar, 
in that extra terms are added to take account of the point loads as we move towards the right- 
hand end of the beam. In fact, due to this similarity, equation (3.10) can be used to apply to all 
spans by rewriting it in a slightly different form: 
d 2 y 

E/ d? = ~ M = ~ RlX + w a <x ~ a> + W B <x ~ b> + W c <x - c> (3.11) 

Note the change of bracket shape in equation (3. 11). The <> brackets are termed Macaulay 
brackets, and equation (3.11) is now applicable to any point in the beam if we adopt what is 
called Macaulay s convention. 

Macaulay’s convention: ‘Whenever a bracketed term becomes negative it must be 
ignored.’ 

Adopting this convention, the general expression for M in equation (3.11) can be integrated 
twice to give the slope, dy/dx, and the deflection, y, at any point along the beam. During the 
integration, boundary conditions for the beam are used to determine the integration constants. 
It should also be noted that the bracketed terms must not be multiplied out in any general 
expression for y, dy/dx or d 2 y/ dx 2 , otherwise the Macaulay convention will be lost. 


Worked example 

Macaulay’s method example - point load on a beam 


« L « J. 

b -Ip 

\w 

rb 

*4 . ° 

S T Ry 

1 L * 


Figure 3.23 Simply supported beam carrying a 
single point load 


Figure 3.23 shows a simply supported beam carrying a point load, distance a 
from the left-hand end and distance b from the right-hand end. 

Derive an equation for the deflection of this beam at any point along its length. 


Firstly, determine the reaction force, R A , by taking moments for the whole beam about B, 


R A L - Wb = 0 


Ra 


Wb 

L 


Now, cutting the beam at a section in the extreme right-hand part of the span, distance 
x from the left-hand end, the left-hand part of the beam can be considered as a free body 
with an unknown bending moment, M, and shear force, S, on the cut section. Taking 
moments about this cut section for the free body, we have, 

M + W<x ~ a> - R a x = 0 (3.12) 

Here we have used Macaulay brackets to indicate that the expression applies anywhere 
along the beam on condition that the Macaulay convention is adhered to. The second 
term in equation 3.12, W<x — a>, is specific to a point load and is termed a ‘singularity 
function’. Different forms of singularity function appear for different types of loading as 
we will see overleaf. 
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From (3.12) and the equation of the elastic line (i.e. (3.6)), 

Wb 


d 2 y 

U dx* = 


- R a x + W<x ~ a> = - y~x + W<x - a> 


where A is an integration constant. 
Integrating again: 

Wb 


EI y=~TT ■ 


6 L*-- W ' 6 

where B is also an integration constant. 

We use the boundary conditions to solve for the two integration constants, A and B: 
y = 0atx = 0 .-. from (3.15) 

0 = 0+ 0 + 0 + 13 
B = 0 

(Note that the second term after the equal sign is zero because the quantity in the 
Macaulay bracket, <x — a>, is negative at x = 0 — ‘The Macaulay Convention’.) 

y = 0 at x = L .-. from (3.15) 

-WbL 2 W(L — a) 3 

U 6 6 

therefore: 


<x - 


a> 3 , 


- + AL 


A = 


-WbL 2 WP _ 
6 6 L 


Wb 


Substituting for A and 13 in (3.15), a general expression for y 
1 f Wbx 3 , W<x ~ a> 3 


EI[ 


6 L 


Wbx I _ 

6 V L 


N.B.The Macaulay bracketed term in equation (3.16) is ignored when x < a. 

The slope, dy/dx can be obtained from (3.14) or by differentiating (3.16). 

A special case for this problem occurs when the load is applied at the centre of the beam 
span. The deflection at point C, the load point, is then given by putting b = a and x = a 
in (3.16), giving, 

= WL 3 
y 48 El 

which is the well-known result for the central deflection of a simply supported beam 
with the load applied at centre span. 


( 3 . 15 ) 




be obtained as follows, 

)] ( 3 . 16 ) 


Other loading conditions 

Uniformly distributed load 

Consider a uniformly distributed load (UDL), gN/m, acting over part of a beam’s span as 
shown in Figure 3.24. The UDL runs from a position C, distance a from the left-hand end of 
the beam all the way to the right-hand end of the beam. A discontinuity occurs where the 
UDL commences at position C. 
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Figure 3.24 Simply supported beam carrying a 
uniformly distributed load (UDL) 


Cutting a section to the right of the discontinuity and considering the part of the beam to the 
left of this section as a free body, moments can be taken about the section, giving the following 
equilibrium equation, 


M + 


q<x~ a> 2 

2 


— R^x = 0 


(3.17) 


Combining equation (3.17) with the equation of the elastic line, 


d 2 v q <x ~ a> 2 

EI 1 ^=-M=-R 1 x + x 


As for a point load, this equation can be integrated twice to give the deflection at any position 
along the beam. Integration constants are determined from the boundary conditions as before. 
The procedure is identical to the point load case except that the singularity function, 
q <x — a> 2 

2 , is different. The Macaulay convention applies equally in this case, with the 

Macaulay brackets in the singularity function indicating whether the term is included or not 
depending on the sign of the bracketed term. It is excluded if <x — a> is negative, i.e. if x < a. 


How do we deal with a UDL which only acts over part of the 
beam span as shown in Figure 3.25(a)? 

In this case, there are two discontinuities at distances a and b 
from the left-hand end of the beam. The UDL can be extended 
to the right-hand end of the beam and an additional negative 
counterbalance UDL superimposed over the newly extended 
part, also shown in Figure 3.25(b). This gives a statically equivalent 
system to the partially extended UDL. 

Cutting a section in the newly extended part and looking at 
equilibrium of moments on the left-hand part of the beam we have, 

q <x — a> 2 q <x — b> 2 
M + ^ ^ 2 R ix = 0 

which, combining with the equation of the elastic line gives, 

d 2 y q <x — a> 2 q <x — b> 2 

EI ^I =~M= -R lX + ^ ^ 







Figure 3.25 Dealing with a UDL acting over part of 
the beam’s span - ‘counterbalancing’ the load 


As before, this equation can be integrated twice to give the deflection at any position along 
the beam. There are now two singularity functions corresponding to the discontinuities in 
UDL at positions a and b. The negative sign for the second function arises from the negative 
counterbalance UDL. 


Point bending moment 

Figure 3.26 shows both a point load, W A , acting at position a, and a point bending moment, 
M b , acting at position b on a simply supported beam. This problem has a combination of 
two loads (point load and point moment) acting on the beam and shows the versatility of 
Macaulays method. 
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Figure 3.26 Simply supported beam carrying a 
point bending moment and a point load 


Cutting a section to the right of the point bending moment discontinuity and considering 
the part of the beam to the left of this section as a free body, moments can be taken about the 
section, giving the following equilibrium equation, 

M- R x x+ W A <x ~ a> ~ M B <x - b>° = 0 (3.18) 

Note the form of the discontinuity function for the point bending moment, i.e. M B <x — b>°. 
The zero exponent is a special form that allows subsequent integration to give higher level 
exponents when deriving the expressions for slope and deflection as shown below. 

Combining equation (3.18) with the equation of the elastic line, 
d 2 v 

El— j = ~M = — R,x + W A <x - a> - M b <x - b>° 


Integrating gives, 


dy 

EI Tx = 


RjX 2 


■ <x ~ a> 2 
A 2 


— M b <x — b> + A 


The point bending moment singularity function has integrated to have an exponent of order 1 
and A is an integration constant. 


Further integration gives, 


Rjx 3 


(3.19) 


The final form of the applied bending moment singularity has an exponent 2. There are two 
integration constants, A and B, which can be solved for by using the boundary conditions. As 
before, equation (3.19) can be used to determine the deflection at any position along the beam 
on condition that Macaulays convention is used when evaluating the bracketed terms. 


Summary of singularity functions 

We have seen that Macaulay’s method can be used to find deflections at any position along a 
beam where point loads, UDLs and point moments produce discontinuities in the bending 
moment distribution. The method can also be used where there is a combination of these loads 
acting on a beam. 

When developing the bending moment equation for the beam with load discontinuities, the 
following singularity functions are used for each different type of load: 


Load 

Singularity function 

Point load [W) 

W<x — a> 

UDL ( q ) with single discontinuity 

q <x — a> 2 

2 

UDL (q) with double discontinuity 

q <x — a > 2 q <x — b> 2 

2 2 

Point moment (M B ) 

Mg <x — b>° 


It is interesting to note that in these singularity functions the exponent is 1 for a point load, 2 
for a UDL and 0 for a point moment. 
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Summary of Macaulay’s method for beam deflections 

(1) Take the origin at the left-hand end of the beam. 

(2) When necessary, extend and counterbalance uniformly distributed loading to the right- 
hand end of the beam 

(3) Obtain an expression for M in the extreme right-hand part of the beam span, i.e. beyond 
the final discontinuity 

(4) Use Macaulay brackets in relevant singularity functions for discontinuities. 

(5) Use the equation of the elastic line, i.e. El d 2 y/ dx 2 = — M. 

(6) Integrate to give d y/ dx and y. Do not multiply out bracketed terms during these integrations. 

(7) Use boundary conditions to determine integration constants. 

(8) Evaluate slope (d y! dx) and deflection (y) at required positions. Bracketed terms are ignored 
when the value within brackets is negative. 


Worked example 



Figure 3.27 Point load and UDL discontinuities acting on the same beam 

Figure 3.27(a) shows a simply supported beam carrying a point load, P = 2kN, 
at centre span and a uniformly distributed load, q = 4kN/m over part of the 
right-hand half of the span. The length of the beam is 2 m and the flexural 
rigidity of its cross section is El = 10 5 Nm 2 . 

Use Macaulay’s method to determine the deflection of the beam at centre span 
below the point load. 


Reaction forces 

Considering equilibrium of the full beam, 


Vertical forces: 


Moments about A 


and from (3.19) 


(3.20) 


p 3 qL 

Ra= 2 + 32 = 1750N 


156 



Solid mechanics 


Moment expression and integration 

The UDL is extended to the right-hand end of the beam and a counterbalancing 
negative UDL is added, as shown in Figure 3.27(b). Taking moments about a section ii 
the extreme right-hand end of the beam beyond the last discontinuity, we have, 


M = R a x — P<x - 


q<x — y> 2 q<x 


3 L. 


-> 2 


and combining with the equation of the elastic line, 

8 L 2 3L 2 

d 2 y t q<x-^> z q<x a~> 

EI-^s = ~M= -R a x + P<x ~ ~ > + 2~^ 


Integrating twice gives, 

dy _ R\X 2 P <x ~ f > 2 l<x- f > 3 

B l dx 2 + 2 6 

R a x 3 P<x~^> 3 q<x- |> 4 

EI y = ~— + — 6 — + — 24 — 


, 3L 3 

q <x - 
— 

q<*~f > 4 


24 


+ Ax + B 

(3.21) 


where A and B are integration constants. 

Boundary conditions 

Setting the boundary conditions in equation (3.21) we have, 

When x = 0 ,y = 0 B = 0 

N.B. Negative Macaulay bracketed terms are ignored, i.e. set to zero 
When x = L,y = 0 

r a l 3 mP <i lA , <i l4 

U 6 48 16.24 256.24 

Substituting values for R A , P, q and L, and solving for A gives, 

A = 922 Nm 2 


Deflection at centre span 

Equation (3.21) can now be used to evaluate the deflection at centre span. As x = L/2, 
the three Macaulay bracketed terms are zero (using Macaulay convention). Therefore, 


Ely = ~ 


- 922.1 = 630 



Statically indeterminate beams 

Macaulay’s method can also he used to solve for deflections and slopes of a statically indeterminate 
beam, i.e. a beam where the reaction forces and moments cannot be determined by the 
equations of statics alone. An example is a clamped-clamped beam subjected to a point load, as 
shown in Figure 3.28(a). 
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Drawing the free-body diagram for this beam, Figure 3.28(b), 
the end reactions comprise moments M A and M B , which restrain 
rotation, and the vertical reaction forces R A and R [s . There are 
therefore four unknowns that cannot be solved for by equilibrium 
conditions alone. They have to satisfy both equilibrium (two 
conditions) and the boundary conditions, i.e. slope and deflection 
at both ends (four conditions). Equilibrium is incorporated in 
the Macaulay bending moment equation, and, as we have seen, 
boundary conditions are also an integral part of the method. 

Taking a section, X— X, at distance x from the left-hand support, 
beyond the load discontinuity, and considering the left-hand 
part of the beam as a free body, the equation for the unknown 
moment, M, on the section is, 


fti> 


Figure 3.28 Statically indeterminate beam: clamped- 
clamped beam subjected to a point load 


d 2 y 

M = -El — y = M a + R a x ~ P<x - a> 


Integrating this expression twice gives, 


~ EI ~aZ - m aX + ~ 


P<X ; a>2 + A 


and 


Rax 3 

6 


P <x — a> 3 

6 


+ Ax + B 


(3.22) 


This expression for the deflection, y, includes four unknowns, namely M A and R A , and 
the integration constants A and 13. We can use the boundary conditions to solve for these 
unknowns. 


Boundary conditions 

When x = 0,y = 0 

B = 0 


When x = 0, dy/dx = 0 


N.B. Negative Macaulay bracketed terms are ignored, i.e. set to zero 
When x = L,y = 0 

o = + Ml _ ElLzJt 


When x = L, dy/dx = 0 


0 = M a L + 


r a l 2 

2 


P(L ~ a) 2 

2 


(3.23) 


(3.24) 


N.B. Macaulay bracketed terms are converted back to normal brackets at this stage. 
Equations (3.23) and (3.24) can be solved simultaneously to give R A and M A as follows, 


Ra 


P(L 


a) 2 (L + 2a) 

L 3 


(3.25) 


M a 


Pa(L - a) 2 
L 2 


(3.26) 
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Substituting for R A and M A from (3.25) and (3.26) into (3.22) gives the required general 
expression for deflection, y, at any position along the beam, 

_ i \ Pa(L - a) 2 x 2 P(L - af{L + 2a) x* P<x - a >3 ] 

y El [ 21? 61? 6 J 

Special case - centrally loaded beam 

For a beam loaded at centre span, i.e. a = L/2, Equations (3.25) and (3.26) give, 



And the maximum deflection at the centre of the span, under the load, is, 

1 \M a L 2 R a L ? P(0) 3 1 

Fmax - - [ 8 + ””48 6 J 

= PL 3 
192EI 

which is the well-known expression given in many textbooks. 

Comparing this result with a simply supported beam, where the central deflection is given 
by y max = PI?/48EI, it can be seen that clamping the ends of the beam results in a deflection 
which is 25% of the deflection of a simply supported beam. 


Learning summary 

By the end of this section you should have learnt: 

/ to derive the differential equation of the elastic line (i.e. deflection curve) of a beam; 

/ to solve this equation by successive integration to yield the slope, dy/dx, and the deflection, y, of a 
beam at any position along its span; 

/ to use Macaulay’s method, also called the method of singularities, to solve for beam deflections 
where there are discontinuities in the bending moment distribution arising from discontinuous 
loading; 

/ to use different singularity functions in the bending moment expression for different loading 
conditions including point loads, uniformly distributed loads and point bending moments; 

/ to use Macaulay’s method for statically indeterminate beam problems. 


3.5 Elastic-plastic deformations 

Elastic-plastic material behaviour models 

Elastic— perfectly plastic (EPP) 

In this case there is assumed to be no hardening, i.e. the yield stress, cj y , 
is assumed to remain constant at ±<r y , regardless of any previous plastic 
deformation, see Figures 3.29 and 3.30. Therefore, the yield surface doesn’t 
change in either shape or position in the principal stress-space, see Figure 
3.31. 

This is a good model for mild steel with moderate plasticity, but is also used 
very generally for materials without well-defined yield, i.e. <r y = ct 0 2 %, for 
moderate plasticity. 
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Figure 3.30 Uniaxial cyclic cr-e 
behaviour for an EPP material 
model 



Figure 3.31 Representation of uniaxial 
stress-strain behaviour (Figure 3.30) in 
principal stress space 


Isotropic hardening 

For an isotropically hardening material, 
any plastic deformation causes an increase 
in the yield stress, and the yield surface 
continuously expands isotropically (i.e. by 
the same amount in all directions) with 
loading, reverse loading and reloading 
involving plastic deformation, as 
indicated in Figures 3.32 and 3. 33. Thus, 
cyclic tension— compression loading, i.e. 
varying between equal and opposite fixed 
values of applied stress, which initially 
gives yield (in tension) will quickly 
‘shake down’ to elastic behaviour, since 
the subsequent compressive stress will 
not be large enough to cause yield in 
compression. 



Figure 3.32 Uniaxial cyclic 
cr-E behaviour for an isotropic 
material model 



isotropic stress-strain behaviour 
in principal stress space 


Kinematic hardening (assuming linear plasticity for simplicity) 

For a kinematically hardening material, the yield stress in compression following yield in 
tension is reduced due to the tensile yield, as indicated in Figures 3.34 and 3.35. 



Figure 3.34 Uniaxial cyclic <t-e behaviour 
for a kinematically hardening material 



Figure 3.35 Representation of kinematic 
stress-strain behaviour in principal 
stress space 
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Kinematic hardening assumes that the yield range remains constant at a value of 2a y , so that 
the subsequent yield in compression will be the highest tensile stress with plasticity, — 2a y , as 
shown in Figure 3.34. This can be represented in principal stress space as a translation of the 
yield surface without changing shape or size, as shown in Figure 3.35. In this case, cycling 
between tensile and compressive stresses of equal and opposite values, which initially gives yield 
(in tension) will also give yield due to the subsequent compressive stress since the compressive 
yield stress will have been reduced. Thus, the kinematic hardening model predicts that constant 
alternating plastic strains will occur after the first loading cycle. 

In the following analyses related to the elastic— plastic deformation of components (e.g. beams 
in bending, and torsion of shafts) only EPP material behaviour models will be considered. 

Bending of a beam with rectangular cross section, assuming 
elastic— perfectly plastic material behaviour model 

The following examples will illustrate the basic approach. 


Worked example 

A rectangular section beam (100 mm wide X 200 mm deep) is made from an 
elastic-perfectly plastic material with E = 200GN/m 2 and a y = 250N/mm 2 . 
Calculate the radius of curvature and the bending stress distribution when a 
pure moment of 190 kNm is applied and after the moment is removed. 

Check whether or not yield has occurred. 

The applied moment at first yield, M y , is 
that which causes the maximum elastic 
bending stress to become equal to the yield 
stress. Since the maximum elastic stress 
occurs at the top and bottom (extreme) 
fibres of the cross section, first yield occurs 
when a = ±a } , at the positions y = 

±100 mm. Thus, M y is given by 

(100 X 200 3 ) 4 


I 


; E = 200GN/m 2 


O - 1 250 N/mm 2 X 


M = — =- 


12 


Figure 3.36 The EPP material 
behaviour model 


100 n 


i.e. M y = 166.7 X 10 6 Nmm = 166.7kNm 

The applied moment (190 kNm) exceeds M y and therefore yielding will occur. 




Figure 3.37 A section of the beam in bending, the beam cross section and the variation of 
stress through the beam cross section 
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Assume that yielding occurs at y 3= a and y =£ —a. 
Variation of stress with y: 




For -a < y < a, ct = +250 -N/mm 2 
For a < y < |, a = +250 N/mm 2 

for - 1 < y < -a, a = 250 N/mm 2 


Moment equilibrium: 


I » — + d/2 

m =Ja yivdA) = J y= _ d/2 <ry(bdy) 

n = d/2 

M = 2 J _ q aybdy 

M = 2 {jy= 0 ( ~ 250 l) ybdy + Iy= + +T 25 °y bd y\ 

-2x-wbi:i 


which gives 


Compatibility requirement: 

V 

E = R 

At y = +84.85 mm, ct = 250 N/mm 2 and since this point is within the elastic range, use 
of the linear elastic stress— strain equation gives: 

= a = 250 = 84.85 = (T\ 

E E 200 X 10 3 R [rI 

R = 67.9 X 10 3 nun = 67.9 m 

On unloading, assume that the stress change which occurs is elastic, but check the 
resulting solution to establish whether or not reverse yielding has occurred in order to 
validate this assumption. 
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Assuming elastic unloading, the maximum stress change will occur at y = 
given by 


, My 

A <*=-/ = 


-190 X 10 6 Nmm X (+100 mm) 
1 100 X 200 3 j TTiTTi 4 


= —285 N/mm 2 

Aa^ lax = -285 N/mm 2 at y = + 1 
with a corresponding stress change of +285 N/mm 2 at y = — ^ 

By adding the stresses on the cross section when loaded to the stress changes that occur 
on unloading, the residual stresses can be obtained, as shown in Figure 3.38. 



Figure 3.38 The stress distribution on loading, the stress change on unloading and the 
residual stress distribution 

It is clear that the residual stress is well below the yield stress of ±250 N/mm 2 , so reverse 
yielding does not occur. 

There will also be a residual curvature corresponding to the residual stresses. At y = 

84.85 mm, there has been no plastic deformation. Therefore, the residual strain is given by 


e (residual) = 


8.2 


84.85 

R 


' 200 X 10 3 
R= 2.070 X 10 6 mm 

i.e. the residual radius of curvature is 2070 m. 

On releasing the moment, the radius of curvature changes from 67.9 m to 2070 m. This 
change of curvature is called ‘spring back’ and is particularly important when bending 
bars of metal to specified radii of curvature. 


Torsion of circular shafts, assuming an elastic-perfectly plastic 
material behaviour model 

Figure 3.39 shows a circular shaft subjected to pure torque, T. This results in a twist, 

0, and the rotation of an initially axial line by an angle y.This causes the distortion of an 
element of material on the surface of the shaft, as indicated in Figure 3.39. 
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Figure 3.39 A circular shaft subjected to a pure torque, T 


Figure 3.40 Distortion of the initially rectangular element and 
t— y material behaviour 


Compatibility requirement: 

Assuming plane sections remain plane and radii remain straight, then at r = R 

i x (?,=*) = *e => y r = R = E f 


and hence at a general radial position, r, 



(3.27) 


i.e. the shear strain varies linearly with radius, r. 

Since the highest strains occur at the maximum radius r = R, then yield will start at the outer 
surface and spread inwards as the torque is increased, as indicated in Figure 3.41. 



Equilibrium (moments about the axis): 


Torque 

arm 



(3.28) 


As with the case of beam bending, residual stresses can be obtained by determining (elastically) 
the change of stress that occurs when the load is removed. Remember that it is necessary to 
check that the residual stresses are not great enough to cause reverse yield. For elastic unloading, 
the following elastic torsion formulae can be used: 

T _ t _ G6 

J r l 
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The relationship between t— y and the more commonly available uniaxial a — s data depends 
on the yield criterion. The Mohr’s circle for this case, Figure 3.42, indicates that 07 = — cr 3 = 
t and a 2 = 0. Hence, for a material obeying the von Mises criterion. 


Vvm = - ° 2) 2 + ( 0-2 - 0- 3 ) 2 + (ff 3 - CT !) 2 

= (Ty at yield 

= ^V(2t y) 2 + (t y) 2 + (Ty) 2 = /3TJ, = a y at yield 

(Ty 

=> Ty = = 0.577O-J, 

And for a material obeying the Tresca yield criterion 

CT y = CTi - CT 3 = Ty — (—Ty) = 2T y = (T„ n 

a y 

Ty = y = 0.5a y 



Figure 3.42 Mohr’s circle 


Worked example 

A solid circular shaft of diameter 50 mm and length 1000 mm is subjected to 
a torque, T. The shaft is made from an elastic— perfectly plastic material with 
y = 100N/mm 2 , G = 70GN/m 2 . Determine the magnitude of the torque 


required to cause yielding to occur at ; 
angle of twist. 


radius of 15 mm (and greater) and the 



Figure 3.43 

Equilibrium (moments about the axis) 


r =25 

T = J _ () rX t X 2Ttrdr 
Torque arm X force stress X area 
T= ( 5 2Tt^r 3 dr+ 200'itr 2 dr 


L3] 15 j 

?nn J 15 4 1 25 3 15^1 

- 200 'ir[ 15 x4 + 3 - 3 J 

T= 3.096 X 10 6 Nmm = 3.096 kNm 
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Relationship between T y and y y : at the outermost elastic point, t = T y and y = y } , and the 

T y 

elastic relation G = — is applicable. It should be noted that outside this outermost elastic 
point, ^ y 

i.e. r = 15 mm, the strain, which will be larger, will be elastic— plastic and consequently 
will not be governed by t = Gy, which is the elastic relation. Nonetheless, at r = 15 mm, 


^=r = i 


100 


‘ G 70000 
by invoking the compatibility requirement, 

r y 0 = y y l 

and hence 

„ V 


1.4286 X 10“ 3 rad 


_ 1,4286 X 10~ 3 X 1000 n 
15 mm 


[360 deg] 
\ 2tt rad / 


Worked example 

A stepped circular shaft is built 
in at both ends; the two shaft 
segments are the same length, l. If 
the stepped shaft is made from an 
elastic-perfectly plastic material 
with T y = 100N/mm 2 , find the 
magnitude of the torque applied at 
the step to just cause yield in the 
smaller shaft. 



u 

l 2 






\ 

A t 




) T 



<t>40mm / T 


Stress-strain behaviour 



Equilibrium 




© 


- 0 ---) 


T = Tj + T 2 
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The small shaft just reaches yield, so the linear elastic torsion equation can he used for 
that shaft, i.e. 

T _ t _ G6 

J r l 

T 2 , _ lOON/mm 2 


=» T 2 = 157.1 X 1 0 3 Nmm = 157.1 Nm 

For the large shaft, 0 and l are the same as the small shaft, and since y = (elastic or 
plastic), then 

7llr=10mm = 1fe!r= lOmni 

Therefore, shaft 1 must also be elastic for r < 10 mm and will be plastic for r > 10 n 
Thus, for the large shaft, 

for r< 10 mm, t = = 10r 

for r> 10mm, t = t v = 100N/mm 2 






Equilibrium 


Torque ar 


r X lOr X 2irrd r + j 
stress X area 
i X force 


r X 100 X 2'nrdr 
V^stress X area 
Torque arm X force 


T t = 20 t7 r 3 dr + 200tt r 2 dr 




200Trfi: 


. 20 „ x 15! + 200 ,[2Qi_l|!] 

Tj = 1623 X 1 0 3 Nmm = 1623 Nm 
total torque = T = T t + T 2 

= (1623 + 157.1) Nm 
T = 1780 Nm 
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Learning summary 

By the end of this section you should have learnt: 

/ the shapes of uniaxial stress-strain curves and the elastic-perfectly plastic approximation to 
uniaxial stress-strain curves; 

/ the kinematic and isotropic material behaviour models used to represent cyclic loading behaviour; 

/ the elastic-plastic bending of beams and the need to use equilibrium, compatability and rr-e 
behaviour to solve these types of problems; 

/ the elastic-plastic torsion of shafts and the need to use equilibrium, compatability and T-y 
behaviour to solve these types of problems; 

/ to determine residual deformations and residual stresses. 


3.6 Elastic instability 

Introduction 

For many structural problems, it is reasonable to assume that the system is in stable equilibrium. 
However, not all structural arrangements are stable. For example, consider a 1 metre-long stick 
with the cross-sectional area of a pencil. If this stick were stood on its end, the axial stress would 
be small, but the stick could easily topple over sideways. This simple example demonstrates that 
in some configurations, stability considerations can be primary. 

This section is concerned with the stability of struts. Struts are compression members with 
cross-sectional dimensions that are small compared to the length, i.e. they are slender. If a 
circular rod of, say, 5 mm diameter, which has its ends machined flat and perpendicular to the 
axis, were made 10 mm long to act as a column, there would not be a problem of instability 
and it could carry a considerable load. However, if the same rod were made a metre long, the 
rod would become laterally unstable at a much smaller applied force and could collapse. 

Buckling also occurs in many other situations with compressive forces. Examples include 
thin sheets, which have no problem carrying tensile loads, narrow beams unbraced laterally, 
and vacuum tanks, as well as submarine hulls. Thin- walled tubes can wrinkle like paper when 
subjected to torque. 


Buckling phenomenon 

Consider the response of a marble when subjected to disturbances from an initial equilibrium 
position on different types of surfaces, as shown in Figure 3.48. If the surface is concave, the 
marble will return to its original equilibrium position and the marble is said to be in a stable 
equilibrium position. If the surface is flat the marble will move to another equilibrium position 
and the marble is said to be in a neutral equilibrium position. Finally, if the surface is convex, 
the marble will roll off uncontrollably in an unstable fashion and the marble is said to be in an 
unstable equilibrium position. 

O O 

Stable equilibrium position Neutral equilibrium position Unstable equilibrium position 

Figure 3.48 Equilibrium states for a marble on various surfaces 
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This analogy is useful for understanding the energy approach to buckling 
problems. Every deformed structure has a potential energy associated with it, 
which depends on the strain energy stored in the structure and the work done 
by the external loads. A concave potential energy function at equilibrium gives 
a stable equilibrium while a convex potential energy function gives unstable 
equilibrium. 

Alternatively, buckling problems may be treated as bifurcation problems. 
Referring to Figure 3.49(a), it is clear that the tensile force will tend to restore 
the bar to equilibrium if there is a slight displacement to the right. However, 
the same bar under the action of a compressive force, Figure 3.49(b), will 
continue to fall when subjected to a slight displacement. This illustrates unstable 
equilibrium. 

Figures 3.50(a) and (b) illustrate a slightly more complicated example of the 
same phenomenon. The vertical bar is supported horizontally by two springs 
of stiffness, k. If the bar of length L, is displaced a small amount, x, horizontally, 
there is a displacing moment of Px about O and a restoring moment 2 kxL. 
Hence we get Px < 2 kxL for stable equilibrium and Px > 2 kxL for unstable 
equilibrium. 



Figure 3.49 Examples of stable and 
unstable equilibrium 


i 


The critical condition occurs when 


Px = 2 kxL 


P c = 2kL 

where P c is termed the critical load between stable and unstable equilibrium. 

P P 


© 


Figure 3.50 Axially loaded rigid bar with 
transverse springs 

Figure 3.51 shows a rigid bar subjected to a compressive axial load with a torsional 
spring at its base; a free-body diagram of the problem is also shown. Taking 
moments about the point O, gives. 

PL sin 0 = K 0 Q 

PL = _0_ 

K 6 sin0 

PL 

Figure 3.52 shows a graph of tt - versus 0. There is a stable region for low loads 

and an unstable region for high loads. Below point A, the bar will return to its 
equilibrium position if rotated slightly to either the right or the left. Once the load 





P 
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exceeds the value at point A, then any disturbance will cause the bar to rotate along 
either the right branch or the left branch of the bifurcation curve. Point A is called 
the bifurcation point, at which there are three possible solutions. The associated load 
at point A is called the critical (buckling) load. 0 = 0° is the trivial solution. Only 
non-trivial solutions are generally of interest, i.e. for 0 # 0°. 



Figure 3.52 Variation of PUK a with 0, 
indicating stable and unstable regions 


Ideal struts 

Ideal struts are assumed to be initially perfectly straight and of uniform section, and subjected 
to purely axial loading. Expressions will be developed relating the critical buclding load to 
the applied load, the material properties and the member dimensions, for different support 
conditions of the struts. At a critical load, members that are circular or tubular in cross section 
could buckle sideways in any direction. Often, compression members do not have equal 
flexural rigidity, El, in all directions and there will he one axis about which the flexural rigidity 
is a minimum, depending on the dimensions. The member will therefore buckle about this axis 
and the /-value (second moment of area) referred to here is assumed to be the minimum value, 
based on the nominal dimensions of the member. 


Case 1: Hinged-hinged 

Consider an initially straight strut with its ends free to rotate around 
frictionless pins, as shown in Figure 3.53, which will be referred to as a 
hinged-hinged case. The dashed line represents the initially straight strut. 
The strut is now considered to be perturbed, from its initially straight 
position, as shown in Figure 3.53. This perturbation is equivalent to the 
movement of the marble in Figure 3.48. 

The bending moment, M, depends on the deflection, y, and is hence a 
function of position, x. 

The deflection, y, is related to the moment M, by the relationship 


and 


d 2 y 

dx“ 


= -M 


p _l 

— x 




L 

Figure 3.53 A hinged-hinged strut 



Figure 3.54 Free-body diagram for 
the left-hand portion of the strut 


M= Py 


d 2 y p 
dx* + Tl y -° 
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or 


where 


Hence, 


fy 

dx 2 


cry = 0 


a 


_P 

El 


y = A sin ax + B cos ax 


In order to determine A and B, we need two boundary conditions, i.e. at x = 0, y = 0 and at 
x = l, y = 0. 

Therefore, B = 0 and A sin(a/) = 0 

The condition A = 0 results in a trivial solution, i.e. y = 0, which is the case for an undeflected 
strut. Hence, the non-trivial solution is sin (a/) = 0, which gives od = n tt, where n = 0, 1, 2, . . . 

a 2 / 2 = « 2 t t 2 



n = 0 gives another trivial solution, i.e. P = 0, 
n = 1 gives 

p c = ^ 

This is called the Euler buckling (or crippling) load, P c ; it is the lowest load at which 
buclding can occur (Euler solved this problem in 1757). 

For n = 2 we have 

p _ 477 2 _EI 

f 

and this corresponds to a different deflected (budding) shape of the strut. 

For n = 1 , y = y max at x = ^ and therefore A = y max and the deflected shape of the strut is 
given by the following expression: 

... . / rmx \ 

y = ymaxSin(ax) = y ma xSin^— j 


The magnitude of y max cannot be determined from the boundary conditions and it can 
become arbitrarily large, leading to elastic instability of the structure. The first three budding 
mode shapes are shown in Figure 3.55. If buclding mode I is prevented from occurring by 
installing a restraint (support), then the column would buckle at the next highest mode at a 
critical load value that is higher than for the lower mode. The inflexion points, 7, for each 

d 2 y 

deflection curve have zero deflection. Recalling that the curvature at an inflexion point 

is zero indicates that the internal moment at these points is zero. If roller supports are put at 
any other point than point 7, the boundary value problem must be solved for new eigenvalues 
(budding loads) and eigenvectors (mode shapes). 
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Where 


where 



Figure 3.56 A fixed-free strut 


“ ~ M 

M= Py 
dry 

EI d^ + p y = ° 

d 2 y 

-^ + « 2 y = o 


a 2 


_P 

El 
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The solution to this differential equation is: 

y = A sin ax + B cos ax 
At x = 0, y = 0, and therefore B = 0. 
dy 

At x = /, -j^ = 0, and therefore A a cos (a/) = 0 

So far, the mathematical solution is practically identical to that of a free— free strut. However, the 
boundary conditions are different, i.e. in this case, A = 0 or a = 0 or cos (a/) = 0, leading to 
trivial solutions (as before). The non-trivial solution results from taking cos (a l) = 0, which 
implies that a l = 

i.e. ~~ l 2 = where n — 1,3,... 

The smallest, non-trivial, value of P occurs with n = 1, i.e. 

c 4 1 2 

By comparison with the hinged— hinged case, it can be seen that the solution is the same except 
that T is replaced by ‘ 21 ’, i.e. the fixed— free case can be treated as the hinged— hinged case for a 
strut with an equivalent length of 21. 


Case 3: Fixed-fixed 

The solution procedure is the same as that for cases 1 and 2, leading to: 
4 tt 2 EI 
l 2 

The fixed— fixed case shows a significant increase in the buckling 
capacity relative to the hinged-hinged case. 



Figure 3.57 A fixed-fixed strut 


Case 4: Fixed-hinged 

p = 2.045tt 2 E7 ^ 2tt 2 EJ | 

This case differs from the previous examples in that a transverse 
R, is necessary to create this mode of deformation. 

d 2 v Figure 3.58 A fixed-hinged strut 

m i = - M 

but M + Rx = Py 

&y,P_ = _R 

dx 2 EI y EI X 
d 2 y 2 Rx 

1 e ^ + a,= El 

where a 2 = 

The solution to this type of differential equation consists of two parts, a homogenous solution 
and a particular integral. The homogeneous solution is given by 

y = A sin ax + B cos ax 
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The particular integral for such second order differential equations is generally obtained by taking 
P- 1 =y=C-f{x) 


where 


d 2 y 

dx 2 


+ a 2 y = const -/(x) 


And substituting for y in the differential equation gives the solution for C. In this particular 
case,/(x) = x, so that 


a 2 C-f(x) = §/(x) 


a 2 El P 

nr R a \ R 

Rl =Y = -j5 -ft*) = AS X 


Hence, the complete solution is: 


y = A sin ax + B cos ax + —x 


Homogeneous solution 


Particular integral 


There are three unknowns this time, A, B and R. Therefore, we need three boundary 
conditions, i.e. at x = 0, y = 0 and so B = 0, 

at x = l, y = 0, and so, 0 = A sin al + ~ l 


Therefore 


A = EL _ 

Psinal 


Also, at x = 1, 


dy 

i = °> 


Therefore 

The smallest non-trivial 


Rla 


cos a/ + vr = 0 


Psinal 

tana / = al 
this equation is 

al = 4.493(~1.43tt) 

a 2 / 2 = -^jl 2 = 1.43 2 tt 2 

2,045tt 2 El 

l 2 


Example problem 

A steel framework with four vertical legs of solid rectangular cross section (20 mm X 30 mm) 
each of length 400 mm is to be used to support a platform carrying equipment. Determine the 
maximum weight that the framework can support using the following assumptions: 

(a) hinged-hinged connections at both ends 

(b) fixed-fixed connections at both ends 

(c) hinged— fixed connections. 

Assume E = 200 GPa and a yield stress of 500 MPa for the steel. 
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Case 5: Hinged-hinged with initial curvature 

Figure 3.59 shows a hinged— hinged strut with initial curvature, y 0 , and the deflected shape, y. 


Deflected 



Figure 3.59 A hinged-hinged strut with initial curvature 

Suppose that y 0 = esinf-yr-j describes the initial shape. 


From the free-body diagram (Figure 3.60), 

^,d 2 (y - y 0 ) 


= -Py 


Where 


d^y 

dx 2 


+ a 2 y 


a 




Figure 3.60 Free-body diagram 
for a hinged-hinged strut with 
initial curvature 


In this case there is a particular integral, as well as a homogeneous function. 

Assume the particular integral is obtained by taking y = Csin^^p-j, where C is 
constant, and 


substitute this into the differential equation: 



unknown 


Thus the overall solution, i.e. the homogeneous function and the particular integral, is 


y = A sin(ax) + B cos(ax) — y 


The A and B values are obtained by substituting the boundary conditions 
At x = 0, y = 0 


At x = l, y = 0 

0 = A sin (od), i.e. A = 0 

TT 2 . TTX 



TT 2 

When -p = a 2 , y = oo (except at x = 0 or /) 

TT 2 P 

1,e ‘ f ~ El 

t^EI 
1 f 
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i.e. the initially curved 


buckles when 


P(= Pc) = - 


and 



Therefore, the assumed initial shape has tittle effect on the solution, i.e. 


P c 


TT El 

f 


and the deflected shape, y, would be practically the same. 


Eccentrically loaded struts 



Figure 3.61 An eccentrically loaded strut 


Consider a long, slender member subjected to a slightly eccentric compressive load as shown 
in Figure 3.61; the axial load is P and the eccentricity of the load is e.This form of loading will 
cause the member to curve so that at a distance x from the left-hand end, the eccentricity of 
the load becomes (e + y).The deflection, y, is related to the moment, M, by the relationship 


d 2 y 

FI — - = - 
dx 2 


and, x from the left-hand-end, 


M= P(e + y) 


d 2 y 

EI ^2 = - p (e + y) 



Figure 3.62 Free-body 
diagram for an eccentrically 
loaded strut 


d2 y , 2 2 

d? + a y = -a “ e 


where 


a 2 


_P 

El 


The solution to this differential equation consists of two parts, a homogeneous solution and a 
particular integral. The homogeneous solution is given by 

y = A sin ax + B cos ax 

In general the particular integral is obtained by taking 

P-/=y = C'/(x) 

where 

d 2 y 

+ a ~Y = const. /(x) 
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and then substituting into the differential equation for y and solving for C. In this case, the 
particular integral is taken to be C, since/(x) is 1 and hence C = -e.Thus, the total solution to 
the differential equation is: 

y = A sin ax + B cos ax — e 

In order to determine A and B, we use two boundary conditions. 

7 dy 

At x = 0, y = 0 and at x = y y; = 0 

Thus, 0 = B — e, i.e. B = e, and 0 = Aa cos^^pj — Ba siny 



So, 


y = etan^yjsin (ax) + e cos (ax) — e 
y = e(tan(y)sin(ax) + cos (ax) - l) 


. I 


The maximum deflection, y , will occur at x = y , so that, 

y 


fWf) + cos(f) - l) 




- 1 


Y = 



1 


y - ) - l) 


Therefore, when y = y, then y = oo 
However, a 2 = so that when y = y, y = oo 

The value of P which causes y = oo is the buckling load (or the 

Euler crippling load), P c , so y = oo, when 


P= P c 


tt 2 El 
l 2 


It is important to note that P c is independent of e, so that the 
buckling load is the same no matter how large or small the 
eccentricity, so even very small eccentricities will give the same 
buckling load. 

Figure 3.63 shows the relationship between transverse deflection 

A P 

y and the applied load ratio y for a range of eccentricity values, e. 



Figure 3.63 Variation of y with P/P c for a range 
of eccentricity values 
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General formula: P c = 

I4r 


Description Schematic Critical buckling load, P c Effective length, L eff 



The effective length, L eff , is a measure of how much longer (and thus more unstable) a given 
strut configuration appears to be in terms of critical buckling load, relative to the hinged- 
hinged case. Thus, the fixed— fixed case, for example, has a shorter effective length because it is 
more stable and thus appears to be shorter with respect to buckling. 


Beams with both axial and transverse loads 

A beam which is acted on by a compressive axial force in addition 
to transversely applied loads is referred to as a beam— column. 

In this section, the buckling load for such a member, under the 
action of a point load at the mid span, is derived. 

/ W d 2 y 

ForO ^x^^,M=Py + yx and M = ~EI 

d 2 y iy x p 

d? + a 2 y=--^2 where a= Ei 



Figure 3.64 A beam-column with a central 
transverse load 


The solution to this differential equation, including using both the homogeneous 
function and the particular integral, is 

y = A sin (ax) + B cos (ax) - yy 



The constants A and B are obtained by using the boundary condition: Figure 3.65 A tree-body diagram 


at x = 0, y = 0 

1 dy 

.'. B = 0 and at x = = 0 

2 dx 


for the left-hand section of the 
beam-column 


0 = Aacos^j 


W 
2 P 
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giving 


Therefore, the solution is 


j y( sin(ax) 
= 2P\ 


The maximum deflection, y max , occurs 


ymax = 



1 

2 


W ( (od)_od) 
2Pa \ tan l 2) 2) 


The maximum absolute bending moment occurs at the mid-span: 


These expressions for y, y max 


M max = 
and M n 


become infinite when — 


od _ n l_ l_ _ rm 
2 - i El 2 ~ 2 


multiple of "z - , i.e. when 


where n is an integer. 
When n = 1, 


P c = 


Ti 2 EI 

l 2 


i.e. the critical buckling load is unaffected by the presence of the transverse load, W. 
The expression for y max can he rewritten as 


y 


W I (al\ _ dl\ 
2EIa? l tan \ 2 / 21 



Since ~^jErj would be the deflection without P, it is clear that the presence of the compressive 
force has magnified the deflection by a factor of 3^tan^pj — (^))/(^) which is 

approximately equal to 1.33 for for example. By contrast, a tensile axial load would 

have the effect of reducing the transverse deflections. 

It also follows that the bending moments in slender members can be substantially increased by 
the presence of compressive axial forces. 
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Some important notes 

(1) In contrast to the classical cases considered here, actual compression members are seldom 
truly pinned or completely fixed against rotation at the ends. Because of this uncertainty 
regarding the fixity of the ends, struts or columns are often assumed to be pin-ended. This 
procedure is conservative. 

(2) The above equations are not applicable in the inelastic range, i.e. for ct > ay and must be 
modified. 

(3) The critical load formulae for struts or columns are remarkable in that they do not contain 
any strength property of the material and yet they determine the load-carrying capacity 
of the member. The only material property required is the elastic modulus, E, which is a 
measure of the stiffness of the strut. 


Compressive loading of rods 







Figure 3.66 Tensile test specimen and elastic 
perfectly plastic stress-strain behaviour 


If we assume that the rod loading is perfectly axial, and the material can be represented by an 
elastic— perfectly plastic stress-strain curve, then the plastic collapse failure would occur 

in compression if ct^ = yy~j reaches — a y before the buckling load is reached. Now 


P c = 

and defining the second moment of are; 


tt 2 EI 
l 2 ] 


where k is the radius of gyration, gives 

_ T! 2 EAk 2 

l 2 

and 

= Ee = V 2 Ek 2 = TT 2 E 
U A p (l/k ) 2 

and l/k is the slenderness ratio. 

tt 2 B 

Therefore, buckling will occur if a = • , y 
[l/k) 

whereas plastic collapse will occur if ct > ay 



Figure 3.67 Plot of ct versus l/k indicating 
the buckling and plastic collapse regions 
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Assuming that the load is to be applied eccentrically as shown in Figure 3.68, this can be 
represented diagrammatically as shown in Figure 3.69. 



Figure 3.68 Eccentrically loaded bar 



Figure 3.69 Effect of ec/k 2 on the buckling 
characteristics of an eccentrically loaded strut 


We find that: 


<T, = 7 1 + 


P(e + y)c \ 
Ak 2 I 


(e + y)c\ 


Recall y = 


for an eccentrically loaded strut 


giving 


P 

A 


( T c 


1 + # sec (f) 


It should be noted that the latter equation is a non-linear equation in P since a also contains 
P.The plotted results were obtained by substituting in the material yield stress for a c and then 
solving for an allowable load P using known values of eccentricity e, thus: 


CT r=-|( 1+ # sec (/ll)) 


It should also be noted that the buckling load for an eccentrically loaded strut is the same as 
that for a hinged-hinged ideal strut. Therefore, the plotted results show that yield failure will 
always occur first except for small eccentricities at high slenderness ratios where the yield curve 
approaches the Euler buckling curve. 

181 


An Introduction to Mechanical Engineering: Part 2 




Worked example 

A hoist is constructed using two wooden bars ( E = 12.4 GPa) as shown 
in Figure 3.70 and Figure 3.71. The allowable normal stress is 13.8 MPa. 
Determine the maximum permissible weight W that can be lifted using the 
hoist for the two cases: (a) L = 1.2 m; (b) L = 1.5 m. 



50 mm 


100 mm 


Section A-A 


50 mm 


120 mm 


Section B-B 


Figure 3.71 



By inspection, we see that member BC will be in compression. Thus, to determine the 
maximum value of W, we need to consider buckling failure of member BC and strength 
failure of both members due to the axial stress exceeding the given allowable stress. The 
internal forces in members BC and CD can be found in terms of W. The axial stresses in 
the members are then compared with the given allowable values to determine one set of 
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limits on W. To determine the critical buckling load, the smaller second moment of area 
about the perpendicular axes through the section centroid should be used and the upper 
limit on W to prevent buckling failure can be found. The maximum value of W that 
satisfies the strength and buckling criteria can now be determined. 

The free-body diagram for the pulley is shown in Figure 3.72, with F BC drawn compressive 
and Fm as tensile. The internal axial forces are found from 


J2F y = 0 =► F CD sin 30 = 2W => F CD = AW 

XX = 0 F bc = F cd c os 30 F bc = 3.464W 

The cross-sectional areas for the two members are A BC = 5000 mm 2 and A cn = 

6000 mm 2 . Thus the axial stresses in terms of W can be found and these must be less than 
13.8 MPa, giving two limits on W: 



4 W 
6000 


=£ 13.8 MPa 


W 20.7 kN 



3.464 W 
5000 


s= 13.8 MPa 


IT=s= 17.9kN 


For cross section A- A, we note that 

4iin = i X 100 X 50 3 = 104 1 666.7 mm 4 
Thus, for length (a), 

Tt 2 F7 tt 2 X 12.4 X 10 3 X 1041 667 

TF Tax? " 56658 - 8 N 


Thus, a second limit on F BC is F BC < 88 529 N, thus 3.464 IT 

< 88 529 N or W < 25.6 kN. Since this is greater than the value that causes yielding, 
the yielding value will occur first, so that the maximum permissible value of W for 

L = 1.2 mis 17.9 kN. 

Then, for length (b), 

tt 2 FJ tt 2 X 12.4 X 10 3 X 104 1 667 _ „ „ T 

P ». = TT = 1500 3 = 88 529,4 N 

Thus, a second limit on F BC is F BC < 56 658.8N, thus 3.464 IT 

< 56 658.8 N or W < 16.36 kN. Since this is less than the value that causes yielding, the 
buckling value becomes limiting, so that the maximum permissible value of W for 

L = 1.5 m is 16.4kN. 

Note: In case (a) the design was governed by material strength whereas in case (b) 
buckling governed the design. If there were several bars of different lengths and cross 
sectional dimensions, it would save significant time to calculate the slenderness ratio that 
separates long columns from short columns, i.e. the buckling failure regime from the 

tt 2 F 

yielding failure regime. Using cr CT = 13.8 MPa in ct = gives L/k = 94.2 as the 

threshold ratio separating 

long columns from short columns. For case (a) the slenderness ratio is 83.1, so that 
it is classified as a short column and material strength governs W nax . For case (b) the 
slenderness ratio is 103.9 so that buckling governs W max . 
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Learning summary 

By the end of this section you should have learnt: 

/ Macaulay’s method for determining beam deflection in situations with axial loading; 

/ the meanings of and the differences between stable, unstable and neutral equilibria; 

/ how to determine the buckling loads for ideal struts; 

/ the effects of eccentric loading, initial curvature and transverse loading on the buckling loads; 

/ how to include the interaction of yield behaviour with buckling and how to represent this interaction 
graphically. 


3.7 Shear stresses in beams 

Whereas bending stresses in beams arising from transverse loading are important, transverse 
(i.e. through-thickness) shear stresses due to these same loads also exist. For long slender 
beams, the shear stresses can generally be neglected, and it is only necessary to do a bending 
calculation for the beam. However, as the beam-span-to-depth ratio reduces, i.e. if the beam 
is shorter and thicker, shear stresses become more important and should be calculated in any 
design evaluation. This can be particularly important for laminated beams, e.g. plywood or 
composite beams, where the transverse shear can cause failure between individual layers (plies) 
making up the beam. 

In this section we will derive a general formula for calculating the shear stress distribution 
through the thickness of a beam. We will then introduce the concept of shear centre which 
is the point through which the resultant of the shear stresses always act. The shear centre 
becomes important for beam sections that have low torsional rigidity, i.e. can twist easily, such 
as thin-walled sections. For such beams, if the resultant of the applied transverse loads does not 
act through the shear centre, it can cause twisting of the beam, i.e. there is a bending— twisting 
interaction in the system. The designer should avoid this situation if possible or, at least, evaluate 
the degree of twisting that might take place. 

Transverse shear stress distribution 

The through-thickness shear force in a beam is the integral of the shear stresses over the 
cross-section. In this section we will determine an expression for the shear stress distribution 
(transverse i.e. through-thickness) at a section as a function of the shear force at that position. 
Consider an element of beam length, 8x, as shown in Figure 3.73. The bending moment at x, 
section AC, is M and at x + 8x, section BD, is M + 8 M. The direct bending stresses on AC are, 
My 

°ac - ~Y~ 

where y = distance from the neutral surface 
I = second moment of area of the section 
and on BD, the bending stresses are, 

(M + 8 M)y 

°BD — j 

Thus, when the bending moment varies along the length of the beam on an element such 
as ABEF, also shown in Figure 3.73, there is a net axial force due to change in the bending 
stresses. The force on the face FB is the integral of the bending stresses over the area FB, 
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Figure 3.73 Element of beam length subjected to a varying bending moment 


Similarly the force on the face EA is 


Fba = l y ydA 


The net force to the right acting on element ABEF is the difference between these: 

Net force (bending) F EA = jf ~^ycbl (3.29) 

In order to maintain the equilibrium of ABEF, shear stresses must act on the plane EF, of 
average value t, as shown in Figure 3.74. These shear stresses are complementary to the 
transverse shear stresses. For positive transverse shear stresses, as shown, the complementary shear 
stresses act in the positive x-direction.The net force to the right due to these complementary 
shear stresses is 


Net force (shear) = t-z-8x 
where z is the width of the section at that depth. 

Now, the equilibrium of ABEF requires the net force due to bending to 
balance the net force due to the complementary shear. Thus, 


tz&x + J^^j^ydA = 0 
18 M f , . 

T = -k-^L rdA 


But, in the limit, 


lim 8 M _ d M 
8x — > 0 8x dx 


where S = shear force at the section 



= IJ> 


(3.30) 
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This is the general expression for transverse shear stress at any position y through the thickness. 
The integral can also be written in discrete form as follows, 

t = ~Ay (3.31) 


where A is the area of the part of the cross-section outside the position, y, at which t is 
determined, and y is the distance of the centroid of this area from the neutral axis, as 
shown in Figure 3.75. 


Determination of shear stress distribution for different 
cross-sectional shapes 

Rectangular section 

Referring to Figure 3.76 and using the discrete form for shear stress distribution, 
i.e. equation (3.31) above, we have, 



Figure 3.75 Definition of 
shear stress parameters in a 
cross-section 


A = l-j — yjb and y = (x + yj-z 


r (bd 3 / 12) ■ b V 

’-fM-r] 


(3.32) 


Note the parabolic distribution of shear stress (i.e. t varies with y 2 ), illustrated 
in Figure 3.77. Also, at the top and bottom of the section, where y = ±d/2, 
equation (3.32) gives t = 0. As expected, there is no complementary shear stress 
on the top and bottom free surfaces, therefore the transverse shear stress is also 
zero at these positions. 


At the neutral a 


= 6 ScP J 
bd 3 4 ^b 


This is the position of maximum shear stress whose magnitude is 1.5 X the average 
shear stress S/bd. 



section 


Note also that, in this analysis, the shear stress does not vary across the width of the section. 



Figure 3.77 Shear stress distribution in a 
rectangular section 


Circular section 

Figure 3.78 shows a solid circular cross-section of a beam. To calculate the transverse shear stress 
distribution in this section, we use the integral form of the shear equation, i.e. equation (3.30). 
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However, because of the circular shape, it is convenient to change the 
variables y and z in this equation to polar variables, R and 0. 
Referring to Figure 3.78 we have 

yi = R sinOj 
dy! = R cosOidBj 
Zi = 2 R cos0t 
z = 2R cos0 

and the second moment of area, I = ttD 4 /64 = ttR 4 /4 
The shear equation now becomes, 



-j: 


Rsin0 1 -2Rcos0 1 Rcos0 1 dl 


Figure 3.78 Solid circular cross-section 


ttR 5 cos0 Je 
4 S [ -cos 3 ( 


But 


4 S 

3ttR 2 


cos 2 0 


cos 2 0 = 1 — 


(3 2 


= 4S 
T 3ttR 2 


(3.33) 


Again, a parabolic distribution and the maximum value of t, at the neutral axis, 
when y = 0 is 


T mav (aty = 0) = 


4 S 

3t tR 2 


In this case, t must vary across the width of the section. As can be seen in 
Figure 3.79, at the free surface the shear stress must be zero. Therefore, the 
complementary shear on the cross section, normal to the boundary, is also zero. 
Thus, shear must be tangential to the boundary as drawn. 


l-section 

To determine the transverse shear stress distribution in an I-section, we need to 
consider the web and flange areas separately. 

Transverse shear in the web 



Figure 3.79 Shear stress 
distribution in a solid circular 
section 


Figure 3.80(a) shows an I-section and the position y where we wish to determine the shear 
stress. Using the discrete form of the shear stress equation we have 


t = j; A Y = ^MtFl + A 2Y2] 


S_ 

lb 


! + * +B f 
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BD 3 _ (B ~ b)d 3 

1 12 12 


The maximum t at y = 0: 


= sfjiDi _ (fi ~ fe)^ 2 | 

Tmax Jfcl 8 8 J 


At the bottom and top of the web, where y = ±d/2: 


r = ir# -P 2 -* 2 ) 


(3.34) 




Transverse shear in the flange 

Figure 3.80(b) shows the position y where we wish to determine the shear stress in the flange. 
Again, using the discrete form of the shear stress equation we have 


S_ 

Iz 


;Ay 


S_ 

lb 


= M— 

2/1 4 




At y = D/2, t = 0, as expected, i.e. zero shear complementary to the free surface. 
At y = d/2 


d 2 ) 


Comparing this expression with equation (3.34), there is a step change in t from the web to 
the flange of magnitude B/b, i.e. the ratio of the flange width to the web width. 

Figure 3.81 shows the transverse shear stress distribution down the centre line of the section 
and illustrates the step change discussed above. The shear in the flanges is small compared to the 
web and the shear stress in the web is approximately uniform with vertical position. Because of 
the small shear in the flanges, the average shear stress in the web is ~ S/bd, i.e. the shear force 
divided by the area of the web. 


The above distribution only applies down the centre line of the section. The shear stresses in 
the flanges are small and non-uniform across the width. This must be the case as they must be 
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zero at the top and bottom surfaces (i.e. free surfaces) of 
the flanges. There are, however, more significant shear 
stresses in the flanges, which act parallel to the flanges, 
i.e. horizontally. These can be determined by a similar 
analysis. 

Horizontal shear in the flange 

Figure 3.82 shows a small element, 8 x, of I-beam over 
which the bending moment changes from M to M + 
8 M.T 0 determine the hidden horizontal shear stress, t, 
at distance a from the edge of the flange, the equilibrium 
of an element of the flange is considered. Equilibrium of 
stresses acting on the element gives 



T 


t 

t 

t 


Z3\ 



t 

t 

t 

0 

J 


1+ ♦ ♦ 11 * *1 

2 


Figure 3.81 Transverse shear stress distribution in an l-section 


f (M + 81W) _ fM , „ 

L — 1 — yu ~ l~i ydA 


+ Tzdx = 0 


1 8 M 
Iz 8 x 


J> 


In the limit as 8 x 


8 M AM 
’ 8 x dx 


J>- 


i A r 


This is the same shear equation as before except that the 
interpretation of the quantities A, y and zr is different as 
shown in Figure 3.82. At a distance a from the edge of 
the flange, the horizontal shear stress is given by, 




Figure 3.82 Determining horizontal shear in the flange of an 
l-section 


= ff <P + d) 

t therefore varies linearly with a from zero at the flange 
edge to a maximum value at the flange centre (a = B/2), 

Vax = If (D+d) 

t is also parallel to the flange, i.e. horizontal. 

We can now draw the dominant shear stresses in both the 
flanges and the web. Figure 3.83 shows the distribution 
of these horizontal and vertical (transverse) shear stresses. 
The critical stress position is likely to be at the join of the 
web and flange where both the shear and bending stresses 
are high. 



Figure 3.83 Shear stress distribution in the flange (horizontal) 
and the web (vertical-transverse) of an l-section 
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Shear centre 

Consider the shear stress distribution in a symmetric, thin-walled channel section bending in 
the plane of the web as shown in Figure 3.84. 



Figure 3.84 Determining shear stress distribution in a thin-walled 
asymmetric channel section 


For the flange at distance a from the edge, the horizontal shear stresses are, 



analysed as above for the flange in an I-section. 

For the web at distance y from the neutral axis, the transverse shear stresses are 



We can now draw the shear stress distribution in both the flanges and the web, as shown in 
Figure 3.85(a). For this shear stress distribution, note that the shear stress in the upper flange 
is in the opposite sense to that in the lower flange, i.e. there is no horizontal resultant. Also, 
as there are no shear stresses on the free surfaces, the shear stresses act along the walls, i.e. 
horizontal in the flanges and vertical in the web. 
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Figure 3.85 Shear stress distribution in the flanges (horizontal) and 
the web (vertical-transverse) of an asymmetric channel section 


We can now look at the resultant forces arising from this shear stress distribution as shown in 
Figure 3.85(b). 
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The total shear force in the lower flange, S 1; is the integral of the shear stresses in this flange: 

" l‘ T,da - 

_ Sdtb 2 

4 1 

An equal and opposite shear force acts in the upper flange. 

The shear force in the web is approximately S, i.e. the total vertical shear load assuming thin 
flanges carry negligible vertical shear load. 

The resultant of all the shear stresses must be the vertical shear force S, and its line of action is 
distance e outside the web. Now taking moments about O in the web, 

S *-2S,f 

_ s t d _ d 2 tb 2 

e S 41 

It can be shown that the resultant of the shear stresses for a 
section, for bending in any plane, always act through one point, 
the shear centre. The shear centre always lies on an axis of 
symmetry. For sections with two axes of symmetry, the shear 
centre is at the centroid. 

If the applied vertical loads do not act in the plane of the 
resultant of the shear stresses, i.e. through the shear centre, then 
there is a torsional load on the section as shown in Figure 3.86. 

For arbitrary solid sections, the location of the shear centre is 
a complicated problem. However, it is not usually important 
to determine the shear centre for solid sections because such 
sections usually have a considerable torsional rigidity and twist 
very little due to bending loads. However, for thin-walled open 
sections, which have low torsional rigidity, the position of the 
shear centre may be very important. 


T= Pz~^ 
- -0 


Figure 3.86 Location of the shear centre in an 
asymmetric channel section 


Worked example 

Shear stresses in a beam 

The section shown in Figure 3.87 is 

subjected to a vertical shear force, 

S = 50 kN, acting down the vertical 
centre line, i.e. the y-axis. The second 
moment of area of the section, about 
the x-axis, which passes through the 
centroid of area, G, is 
l xx = 2.31 X 10 6 mm 4 . G is positioned 
14 mm below the flange. 

(a) Determine the magnitude of the 
transverse (i.e. vertical) shear 
stress at positions A, B, G and C 
on the vertical centre line. 


y a 

-80- 


■ 20 t 


LA 

r— 1 



Rj 


4 G 

l c 


80 x 

20 

. « iff! 



(b) Sketch the variation of the 

transverse shear stress down the 
vertical centre line. 


all dimensions in mm 

Figure 3.87 Worked example: a T-section 
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In the top flange 

Consider a position in the top flange, vertical distance y from the centroid, G, as shown in 
Figure 3.88(a). Using the discrete form of the shear formula, 

S S (34 + y) 

T ° = 2.31J0* ■ so • (8 ° ' (34 - r» — ~ 

= 4^li05< 1156 -l' 2 > 

= 0.0108(1156 - y 2 ) 

At position A, y = 34 .’. t = 0 

At position B, y = 14 .’. t = 10.4 N/mm 2 (MPa) 


III 





all dimensions in mm 

o o 

Figure 3.88 Determining the shear stress distribution in a T-section 


In the lower section 

jTj Consider a position in the lower section, again vertical distance y from the centroid, G, as 
" ,n shown in Figure 3.88(b). 

At position B, there is a step change in the shear stress given by the ratio of the section 
widths at this point: 

i = 1 °'| 0 80 = 20.8 N/mm 2 (MPa) 

At position G, i.e. the neutral axis, we can use the discrete formula for the shear stress. In 
this case, to simplify calculation, the relevant area can be regarded as the area below the 
neutral axis: 

T = lN = 23 iJ ? T 0-< 46 - 40 >- 23 

= 22.91 N/mm 2 (MPa) 

At position C, t = 0 i.e. a free surface. 

A sketch of the variation of the shear stress down the vertical centre line is now given in 
Figure 3.89. 

10.4MPa 

Jg 


Figure 3.89 Transverse (vertical) shear stress distribution in a T-section 
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Shear centre 

For the thin-walled semicircular cross-section 
shown in Figure 3.90, determine the position 
of the shear centre (assume bending about 
the axis of symmetry X-X) 

Shear stress distribution 

To solve this problem it is necessary to change 
from a rectangular coordinate system (x-y) to a 
polar coordinate system (r-0). Referring to Figure 
3.90 and using the integral form of the shear stress 
formula, we obtain a general expression for the 
shear stress distribution parallel to the wall of the 
section. Thus, 


■u* 


with y = Rcostf) 



semicircular cross-section 


AA = As - 1 = RtA§ 



giving, 



SR 2 sin0 
T = t 


1= J y 2 AA = j (R cos 0) 2 • Rd0 ■ t 
= J o R 3 t cos 2 8 A8 
1 J q ^(1 + cos20)d0 




From (3.35) and (3.36), 



SR 2 sin0 -2 
TT R 3 t 

2S sin 0 
tt Rt 


(3.35) 


(3.36) 
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The twisting moment (torque) associated with the above shear stress distribution for the 
whole cross-section is found by taking moments about O: 

Torque = J T • (RdO)t ■ R = sin 0 d0 



To counteract this twisting moment, as shown in Figure 3.91, the shear force, S, must be 
applied at the shear centre, a distance e, given by, 


S(e + R) 


4 SR 

TT 


e = — ~R = 0.273 R 


i 


Learning summary 


By the end of this section you should have learnt: 

/ in addition to longitudinal bending stresses, beams also carry transverse shear stresses arising 
from the vertical shear loads acting within the beam; 

/ how to derive a general formula, in both integral and discrete form, for evaluating the distribution 
of shear stresses through a cross section; 


/ how to determine the distribution of the shear stresses through the thickness 
circular and l-section beam; 


/ in an l-section, in addition to transverse vertical shear stresses 
dominant horizontal shear stresses also occur in the flange; 

/ the resultant of the shear si 


a rectangular, 

the flange and the web, more 
for a section always act through one point, the shear 


/ how to calculate the position of the shear centre; 

/ if the applied loads do not act through the shear centre, then there is a resultant torsional 
load which can result in twisting of the section if the torsional rigidity is low, e.g. in thin-walled 
sections. 
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3.8 Thick cylinders 

Introduction 

Thick cylinders differ from thin cylinders in that the variation of stress through the wall 
thickness is significant in thick cylinders when subjected to internal and/ or external pressure, 
but for thin cylinders, the variation of stress is negligible. Figure 3.92 includes drawings of thick 
cylinders with closed ends and with pistons. For closed-ended, internally pressured cylinders 
the axial force on the inside of the end closures produces a distribution of axial stress in the 
cylinder while for cylinders with pistons the resultant axial force in the cylinder, and hence also 
the axial stress, is zero. 




Figure 3.92 Thick cylinders subjected to internal pressure 


Analysis of thin cylinders 


For an internally pressurized thin cylinder situation, it is reasonable to assume that the 
variations of the stresses through the wall thickness are negligible. This results in the problem 
being statically determinate, i.e. expressions for the stresses can be obtained by consideration of 
equilibrium alone, as described below. 


2o - e rf = pdl 


TTd 2 

p^r 


= (T a Trdt 


pd pR 

CTe = 2 i = ~r 


pd pR 

(Ja = Tt = Hi 



p 


TTd 2 

4 



Figure 3.93 Free-body diagrams used to obtain hoop and axial stresses 
in thin cylinders 
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Analysis of thick cylinders 


Thick cylinder problems are statically indeterminate. Therefore, in order to obtain a solution it 
is necessary to consider equilibrium, compatibiHty and the material behaviour (stress-strain 
relationship). 

Assumptions: 

• plane transverse sections remain plane (this is true remote from the ends); 

• deformations are small; 

• the material is linear elastic, homogenous and isotropic; 


<r e = hoop stress 
c r z = axial stress 

Figure 3.94 A thick cylinder and a free-body diagram of an element of 
material within the cylinder 


Equilibrium 



o- e (8r8<r) 


(07 + + §7)8682: = a r (r 808 z) + 2o- 0 (8r8z) sinp^j 


For small 86 



Therefore: 


a r (r + 8 r )80 + -^8r(r + 8 r )80 = oyr 80 + CTg 8 r 80 


As 


m r + a r 8r H — ^Sr + "^Sr 2 = + cr 0 Sr 

d a, 

8r — > 0, —r~ dr 2 — * 0 


Therefore: 
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Compatibility 


extension 
original length 


Hoop strain, s e 


(r + w)80 — r80 

t80 



8r 


d u 
dr 


(3.38) 

(3.39) 


Axial strain, s z = constant 



an element of material 


(3.40) 


Material behaviour (stress-strain relationships) 

Generalized Hooke s law (linear elastic and isotropic) 

£ e = - v(ov + < t z )) 

£ r = e ( (T '~ v (°e + <u)) 

£* = jj(v z - v(a f + o- e )) 

Equations (3.37) to (3.43) have seven unknowns, i.e. u, erg, u r , cr z , Eg, s, and e z which ; 
functions of r, p 0 , p ; , R 0 , R„ v and E. 

Substituting u = rsg from equation (3.38) into equation (3.39) gives e r = ^(rsg) 
d£g 

i.e. £ = Eg + r-j— 

r H dr 

Using equation (3.41) and equation (3.42) in equation (3.43) gives 
lj(°V - v(a e + og) = 4 (ct 9 - v(<r r + a z )) + 


r I da e da r 
E \ dr V dr 


da A 

v ~d 71 


Using equation (3.40),— = 0, then equation (3.43) gives: 
da, da. da e 


(3.41) 

(3.42) 

(3.43) 

all 


(3.44) 


(3.45) 


(3.46) 
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Substituting (3.46) in (3.45) 


dog dov „ da. „ do 9 
(1 + v)a, = (1 + v)a 9 + r-r~- rv-rr - rv 2 ~rr - rv 2 -rr 


(Jq — d r = rv~ 


dr 

(1 + I’j'r, - (1 + v)ct 0 + r( 1 ~ - rv( 1 + 


d* - V)^T 

Substituting into equation (3.37) from equation (3.47) gives 

do,. do- e dcj r 

rv^-r(l-v)^ = r ^ 

\ do - , da e ] 

i.e. K! - v ) [^7 + ^r] = 0 

^(CT r +a e )=0 

i.e. u r + (Tq = constant = 2A, say 

da, 

ae " CTf = 

d cr r 

dcr, 
r ~& ' 


2cr r = 2A - 

2cr. = 2 A 


(3.47) 


(3.48) 


r dr v 


and using equation (3.48) leads t< 


Note that, since s z = const and (T r + a 9 = const, then equation (3.43) shows that o\ = const, 
i.e. it is independent of r.The value of (J, can therefore be obtained from a consideration of 
axial equilibrium. 


o"e ~ A + ^2 


The constants, A and B, are the so-called Lame s constants, which are the constants of 
integration, can he obtained from the boundary conditions, i.e. 

at r = Rj, cr r = -pi 

at r = R 0 , <T r = —p 0 

A B 

-p< = A -je- 

A B 

and ~p 0 = A - -^2 
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Hence, A and B can be determined 
For closed-ended cylinders: 

tr(R 2 — R, 2 )ct 2 + tt R 0 2 p 0 — 
R?Pi - R 0 2 p 0 

ic - = (R 0 2 - R 2 ) 

For a solid cylinder, i.e. R, = 0 

J3 _ 

°Y(r = Si): — ^4 q 2 — OO 


unless B = 0. 

Therefore, B must be zero, since the stresses cannot be infinite, and so, for a solid cylinder, the 
radial and hoop stresses are equal to each other and they are constant, 

i.e. a r = ct 9 = A 

Also, since a solid cylinder can only have external pressure, the constant A must equal the 
external pressure. 

Displacements are most conveniently obtained by using equations (3.41) and (3.43) together 
with equations (3.38) and (3.40), i.e. 

£e = 7 = E^ 6 ~ + CTz)) 


M 1 , , , .. 

e z = ~y = (c T z — v(cr r + a e )) = constant 

where l is the cylinder length, A/ is the increase in cylinder length and u is the radial 
displacement at radius r. 

Analysis of rotating discs 

Rotating components such as flywheels and turbine discs can be regarded as thick cylinders 
with body forces, as well as possible pressure loads, and represent an extension of the thick 
cylinder theory discussed in the previous section. 

Equilibrium 



Figure 3.96 Free-body diagram of an element of material within a disc 
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At a radius r, v = tor and a r = — = w 2 r, directed towards the centre of rotation, 
a, 78682 + 2a- e 8r8^^j — |a, + — j^8rj(r + 87)8682: = |p|r + -^7)808287] ^r 
a,r + a e 8r — ra, — a,8r — 7-7^87 — t 1 ^) 2 = p( r + ^ ra)2 

- a r - r-77 ~ -5787 = pr 2 (o 2 + plkj w 2 + pr8r(0 2 

Neglecting small terms, i.e. those containing 8r and (87) 2 
da, 

a e ~ a r= r-fo + prM ~ 

also 

a 2 = 0 


Compatibility and stress-strain relationships (linear elastic) 

£0 = 7 = 77 K “ va,) 

du 1 , , 

E r = d7 = ~ V(J «> 

Substituting for u from equation ( 3 . 50 ) into to equation ( 3 . 51 ) gives: 


d 

dr 

(1 to 

- va,)) 

= E^ 

r “ va e ) 



1 da (J 

da,) 


o-e “ 

va, + 7 

l“dT - 

V AI 7 J 

= a, — va e 




/ da e 

da,l 

to ~ 

a,)(l 4 

- V) + 7 

A dr “ 

V ^)=° 


Substituting for a 0 


a, from equation ( 3 . 49 ) into 
7 + pr 2 « 2 )(l + v) + 


equation ( 3 . 52 ) gives: 
da e da, 


da, da, „ . dan 

r — +rv — +{1+v)pr W + r-^ 


i.e., (7 (a 0 + a r ) = -(1 + v)pco 2 r 

a 9 + a, = -(1 + v)pary + 2 A 
Subtracting equation ( 3 . 49 ) from equation ( 3 . 53 ) gives: 
da. . r 2 

2a, + 7-77 = —(1 + v)pciry — pui 2 r 2 + 2 A 

Id,,, pw 2 7 2 (3 + v) _ , 

7Tr^r) = 7 + 


7 2 a, = — par (3 + v)^ + Ar 2 - B 


where B is a constant of integration. Therefore, 


B pw 2 (3 + v) , 
°f = A ~-? 8 r 


(3.49) 


(3.50) 

(3.51) 


(3.52) 


(3.53) 
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and from equation (3.53): 

„ , B , P“ 2 (3 + v) , pw 2 (l + v) ,, 
o"e — A + r 2 + g r 2 r 

. , B P“ 2 (l + 3v) , 

oe = A + -p. g r 


Worked example 

Thick cylinder with pistons 

A cylinder with 50 mm bore and 100 mm OD is subjected to an internal 
pressure of 400 bar. The end loads are supported by pistons which seal without 
restraint. Determine the distributions of stress across the cylinder wall. 

p = 400 bar = 400 X 100 kPa 
= 40 X 1000 kPa 
= 40 N/ mm 2 


TTTTTTTTTT 

P = 400 bar 

liiiiiiiil 



I 




Since there is no axial load on the cylinder, then cr^ = 0. 
For a thick cylinder, 


oe - A + ~2 


At r = 25 mm, <j r = 


-40 =A- 


B 

625 


At r = 50 mm, a r = 0 

0 = A ~ 2500 

EHminating A from equations (3.54) and (3.55) gives: 

40 = ^625 ~ 2500 ) 

MW) 

« 40 X 2500 

B - 3 

Substituting for B into equation (3.55) gives: 


(3.54) 

(3.55) 


0 = A - 40 X 7 
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Hence, 


and 


At r = 25 mm, 


and 


At r = 50 mm, 


and 






40 , 40 X 2500 40 /. . 2500' 

a ° = T + 3r 2 = Tl 1+ ~7^ 


40 40 X 2500 40/. 2500' 

a ' = T = 


CTq = -y X 5N/mm 2 = 66.7 N/mm 2 


(T r = -j- X (—3) N/mm 2 = —40 N/mm 2 


CTg = -y X 2 N/mm 2 = 26.7 N/mm 2 


(T r — 0 



Worked example 

Shrink/interference fit 

A pair of mild steel cylinders (E = 200 GPa) of equal length have the following 
dimensions: 

(1) 40 mm bore and 80.06 mm outside diameter 

(2) 80 mm bore and 120 mm outside diameter 

i.e. there is a diametral interference of 0.06 mm. The larger cylinder is heated, 
placed around, and allowed to shrink onto, the smaller cylinder. Calculate the 
stresses after assembly. 
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Conditions 

• After assembly, the radial interference pressure, p, will be the same on both cylinders, 
i.e. cylinder 1 will have an external pressure, p, and cylinder 2 will have an internal 
pressure, p, as indicated in Figure 3.99. 

• The decrease in the outside radius of cylinder 1, q, plus the increase in the inside 
radius of cylinder 2, i 2 , will be equal to the radial interference, i.e. i = q+ z 2 . 

• Axial stresses are assumed to be zero 
(or negligible). 


I 

i 



Figure 3.99 

For cylinder 1 : 

a B 1 

(J r = A i 2" 

and 

B, 

°e - A i + ~jl 

at r = 20 mm, a r = 0, 

Bi = 40(Mj 

at r = 40 mm (no significant difference with 40.03 mm), cr, = —p 
400 . 


P ~ A \ 402^1 ~ A \ 


1600 1 
A \ = 'TP 


1600 




w.- 


£fl = 7 = s( CTe “ V ( CT ' + = E ((Te “ VCTr) 

At the outside of cylinder 1, r = 40 mm, 

~‘i _ 1 _ . 

40 2OOOOO (CT0 VCTr) 
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za, i I _ — ] [ t + -trro — v( ! - 

40 200 000 \ 3 A 1600 \ 

■ 8 P ( 5 3v\ 

h 30 000 U 4/ 


400 y 

1600/y 


For cylinder 2 


B 2 

CT e - ^2 + r 2 


B 2 = 3600 A 2 


_ n = A _60!. =A _ 3600 
F A 4 q 2^2 ^2 160Q 4 


B 2 = 3600 X JP 


w 2 = |(i-^) 


K)2 = xt 1 + 


3600 j 


e e = 7 = ^ (o’® - v(a r + o- 2 )) = ^(ct 0 - u 


At the inside of cylinder 2, r = 40 mm, 

+ '2 _ 1 /4M/ 3600 / 

40 200000 \ 5 /\ 1600 V \ 

■ Sp { U 5vN l 

1-e - * 2 50 000 1 4 4 / 


3600 V 
“ 1600// 


x(13 + 5v) 


But i x + i 2 = i = 0.03 mm 
2 P 


10ff 


(5 - 3V) + g 
2yp 


7(13 + 5v) = 0.03 


26 P 


2 vp 


30 000 10000 50000 10 000 

50 p + 78 p 


150 000 
y-N/mm 2 = 35.2 N/mm 2 


= 0.03 
= 0.03 
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Worked example 

A turbine rotor disc with an angular 
velocity of 4000 rpm has an external 
diameter of 1.2 m and has a 0.1 m 
diameter hole bored along its axis. 
Determine the stress distributions 
in the disc. 

Take p = 7850 kg/m 3 and v = 0.3. 

. B P^ 2 (3 + v)r 2 

°r = A -? § 

B pu^l + 3 v)r 2 
oe - A + 72 “ § 

at r = 50 mm, cr r = 0 



B 7850 X 10~ 9 


X (4000 X (3 + 0.3) X 50 2 X 10 -3 N/mm 2 


r= 600 mm, cr r = 0 
B 

“ 600 2 
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Subtracting equation (3.56) from equation (3.57) 

Jo-i^' 204 ' 54 - 1 ' 42 

B= 511 350.0 

and 

A = 205.95 

Hence, 

cr. = 205.95 - 51 1 2 350 - 0.000 568r 2 


and 


r = 50 mm, ay = 0 and a e = 409.7 N/mm 2 

r = 600 mm, <7, = 0 and a e — 89.6 N/mm 2 
The maximum cr r value could be obtained by 
dcr r 

”d7 = 0 

to find the r - value and then substituting this value of r into expression for CT r 



Learning summary s 

By the end of this section you should have learnt: 

/ the essential differences between the stress analysis of thin and thick cylinders, leading to an 
understanding of statically determinate and statically indeterminate situations; 

/ how to derive the equilibrium equations for an element of material in a solid body (e.g. a thick 
cylinder); 

/ the derivation of Lame’s equations; 

/ to determine stresses caused by shrink-fitting one cylinder onto another; 

/ to include ‘inertia’ effects into the thick cylinder equations in order to calculate the stresses in a 
rotating disc. 

V 
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3.9 Asymmetrical bending 

M a E 

The beam bending equation, — = — = has been derived and is generally used to 

determine stresses in a beam with a symmetrical cross section. The symmetry is usually about 
an axis perpendicular to the neutral axis of the section. For a section where this symmetry 
does not apply, i.e. asymmetric sections, a complication arises, making bending analysis more 
difficult. In these cases, applying a bending moment about the neutral axis will, in general, 
result not only in bending about that axis but also in simultaneous bending about the 
perpendicular axis; there is an interaction effect. To analyse such sections we introduce a new 
geometric quantity called the product moment of area and this leads to the concept of 
principal second moments of area and principal axes for the section. These are axes for 
which the product moment of area is zero and the above interaction effect during bending 
does not occur. Thus, it is convenient to analyse the bending of asymmetric sections about 
these axes. In this section, we will look at the theory behind this effect and develop a general 
procedure for dealing with asymmetrical bending situations. 


Second moments of area of a complex shaped cross section 

Second moments of area about parallel axes 

Consider an arbitrary shaped cross section, as shown in Figure 3.103. 

The centroid of area, G, is at the origin, O, of the O-x-y axes set. 

A parallel axes set, O'-x'-y' , also exists, distance a and b from the 
O-x-y, as shown in the figure. The centroid of area, G, is positioned 
at coordinates ( x',y ') = ( a ,b) in this parallel axes set. 

We know that the second moments of area, I x and L, of the section 
with respect to the x- and y-axes are given by, 

I x = f y 2 dA 

and 

Iy= J x 2 dA 

i.e. the product of an element of area, cL4, and its distance squared from the particular axis 
(x or y), integrated over the full cross-sectional area, A. 

The parallel axis theorem allows the calculation of the second moments of area, I x < and ly, 
with respect to the x'- and y'-axes as follows, 

1 x . = I x + Ab 2 ( 3 . 58 ) 

and 

ly = I y + Aa 2 ( 3 . 59 ) 

I x and L are the second moments of area about a set of axes through the centroid and are 
always the minimum second moments. 1/ and ly will always be greater because the second 
terms in equations (3.58) and (3.59) are always positive as the distances between the axes, a and 
b, are squared. 



Product moment of area 

We now introduce a new quantity, the product moment of area, l xy , which is defined as, 

= !/y dA 
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I xy is the summation of the elements of area times the product of their coordinates. We can now 
develop the parallel axis theorem for the product moments of area as follows, 


4y = \x’y'dA = J (x + a)(x + b)dA 

= J xydA + a J ydA + bj xdA + abj dA 


but | ydA and I xdA are both zero because the origin of axes Oxy is at the centroid of area, G. 

»a Ja 

Thus, 


4y = 4 y + abA ( 3 . 60 ) 

This is the product parallel axis theorem. Again, I xy is the product moment of area about a 
set of axes through the centroid. In this case, 4y > can be either positive or negative, depending 
on the signs of a and b. 


Symmetric sections 

Figure 3.104 shows a section where one axis (the y-axis in this case) is an axis 
of symmetry. The sum of the contributions to the product moment of area from 
elements of area, cL4, on opposite sides of the axis of symmetry will cancel out 
because of the change of sign of the x-coordinate.Thus, in general, if a section has 
an axis of symmetry, then I xy is zero. 


Rotation of axes 

Referring to Figure 3.105, the question arising is that, given I x , I y and I xy for a 
particular section, what are the second moments of area I u , I v and I uv about a set of 
axes, Ouv, through the same origin but rotated through an angle, 0. 


The coordinate transformation equations for rotation through an angle, 0, are 
u = x cos 0 + y sin 0 
v = —x sin0 + y cos 0 

using these equations, we can now derive the second moment of area about the 
w-axis, 4, as follows, 


but 


and 


4 = j^f 2 cL4 = f A (x 2 sin 2 0 + y 2 cos 2 0 — 2xysin0cos0)cL4 
= 4 sin 2 0 + 4 cos 2 0 - I xy sin 20 

cos 2 0 = j(l + cos 20) 

sin 2 0 = |(1 - cos 20) 





4 = |(4 + 4) + i(4 - 4) COS 20 - 4 y sin 20 

( 3 . 61 ) 

Similarly, we 

can show that 



4 = f(4 + 4) - f(4 - 4) cos 20 + 4y sm 20 

( 3 . 62 ) 


Equations (3.61) and (3.62) differ only by the signs of the last two terms, so adding them 
together gives 

4 + 4 = 4 + I y = Jjx 2 + y z )dA = j A r 2 dA = J 
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This is a statement of the perpendicular axis theorem, i.e. the sum of the second moments 
of area about two perpendicular axes is equal to the second moment of area about the third 
perpendicular axis. In this case, the latter second moment of area is the polar second moment 
of area, J. 

Now considering the product moment of area, I m , we have, 

4„ = J uvdA = J (xcosQ + ycos0)(— xsin0 + ycos0)cL4 

= J [— x 2 cos0sin0 + y 2 sin0cos0 + xy(cos 2 0 — sin 2 0)]cL4 

I m = 1(4 + Iy) sin 20 + I xy cos 20 ( 3 . 63 ) 

Equations (3.61), (3.62) and (3.63) enable us to calculate I u , I„ and I uv , knowing I x , I y and I xy and 
the angle of rotation, 0. 

Principal second moments of area 

Now, looking more closely at equation (3.63). If 0 changes by 90°, 20 changes by 180° and 
I uv changes its sign but not its magnitude. Thus, since it is a continuous function of 0, I uv must 
have zero value. 

The axes which give rise to a zero product moment are called the principal axes. 

For simplicity, let us assume that x and y are the principal axes. Then, 

4 = I p , I y = 4 and 4 y = I pi = 0 

where I„ and L are called the principal second moments of area 
Substituting I p and I q into equations (3.61), (3.62) and (3.63) we have, 

I u = kl p + I q ) +1(4 -4) cos 20 
4 = |(4 + 7,)-|(7 p -4)cos20 

4, = i(4 ~ 4) s,n2 0 

These equations can be represented by a Mohr’s circle as shown in Figure 3.106. Second 
moments are plotted on the x-axis and the product moments are plotted on the y-axis. 

Note The y-axis for the circle is positive upwards, unlike Mohr’s circle for stress, which is 
positive downwards for shear. 



Point A on the circle has coordinates that correspond to the second moment and product 
moment, i.e. ( I u , J„„), relevant to the u-axis. Point B on the circle has coordinates which 
correspond to the second moment and product moment, i.e. (/„, I vu = — l uv ) , relevant to the 
v-axis. These two points enable the circle to be drawn. 
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The centre of the circle, C, and radius, R, are given by, 

I u + I v 

centre C = — x — 


(3.64) 


radius, R = 





+ h/ 


(3.65) 


The points P and Q on the circle correspond to the principal planes for which the product 
moment of areas are zero and the second moments are the principal second moments of area, I p 
and /j. Their magnitudes are given by 


where the centre and radius are given by equations (3.64) and (3.65). 

Thus knowing I u , I v and I uv , the principal second moments of area, I p and I q , can be determined. 

The angle of the principal axes with respect to the u-v- axes is the angle 6, where 20 is shown 
in Figure 3.106 and is given by, 



or alternatively 


Key points about the Mohr’s circle for second moments of area: 

(1) The positive upward direction for the product moment ensures that rotation in the Mohr’s 
circle has the same sense as the rotation of the axes in space. 

(2) I p and I q are both positive. 

(3) If I p = I q , all product moments are zero and all axes in all directions are principal axes, e.g. 
this is the case for a circular section. 

(4) The sign of the product moment is important. I uv = I uvdA is associated with the w-axis 

Ja 

and can be positive or negative. The product moment associated with the f-axis is I m = ~I m . 

Summary of procedure for calculating the principal second moments of area and the directions of the 
principal axes: 

(1) Divide the cross section into subsections for which their centroid of areas and second 
moments of area about their own axes can be determined. 

(2) Choose a convenient set of orthogonal axes with its origin at the centroid of the full cross 
section. 

(3) Use the parallel axis theorem to determine the second moment of area for the full cross 
section. 

(4) Use a Mohr’s circle construction to determine the principal second moments of area and 
the directions of the principal axes. 


I p = centre + radius 


and 


Iq = centre — radius 


tan 20 = 
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Worked example 

Principal second moments of area 
Figure 3.107 shows an asymmetric 
angle cross-section. Determine: 

(a) the principal second moments 
of area 

(b) the directions of the principal 
axes. 


The section is divided into two 
rectangular subsections 1 and 2. 

Position of the centroid: 

total area = 51 


B 

A 

—25.5 

y, 

— 5 

1 *- 

A 

I 

37 




1 l 10 


i 



G 

16.46 

j 

m 

X 







16.M 

mensions in 


2 




All dh 


9^96 \ 


B 

< 10 > 

10.54 


Figure 3.107 Second moments of area of an 
angle section 


X 10 + 54 X 10 = 1050mm 2 


Taking moments of the areas about the datum AA, 

1050 X y = (51 X 10) X 5 + (54 X 10) X 37 
y = 21.46 mm 

Taking moments of the areas about the datum BB, 

1050 Xx=(51 X 10) X 25.5 + (54 X 10) X 5 
x = 14.96 mm 


Second moments of area about a convenient set of axes: 

The x and y axes are drawn as a convenient set of axes through the centroid. Using the 
parallel axis theorem, 


= 51 X 10 3 

* l 12 

= 404 051 mi 

. ( 10 X 51 3 

1 l 12 


+ 51 X 10 X 16.46 2 + 


10 X 54 3 

l 12 


+ 10 X 54 X 15.54 2 


+ 10 X 51 X 10.54 2 + 


- + 54 X 10 X 9.9b 2 } 


= 225 268 mm 4 


And the product parallel axis theorem, 

I xy = (0 + 51 X 10 X 10.54 X 16.46) + (0 + 54 X 10 X (-9.96) X (-15.54)) 
= 172059 mm 4 

Note that, in the product moment of area calculation above, the product moment of 
each subsection about its own axis is zero due to the symmetry of each subsection. It 
is also important that the correct sign for the coordinates of each subsection centroid 
with respect to the full cross-sectional centroid are taken. Thus, for subsection 1, the 
coordinates are both positive (10.54 and 16.46), while for subsection 2, they are both 
negative (—9.96 and —15.54). 
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◄ 


Mohr’s circle: 



Product 

moment 



A (404, 051,172,059) 


2 nd moment 


(225,268,-172,059) 


Figure 3.108 Mohr’s circle 


A Mohr’s circle can now be drawn to represent the axes about which I x , I y and I xy act, as 
shown in Figure 3. 108. The centre and radius are calculated as follows: 


Principal second moments of area: 

The principal second moments of area can now be determined from the circle as follows: 
I p = C + R = 508 554 mm 4 
I q = C- R= 120764mm 4 

and the angle, 0, of the principal axes with respect to the x-axis is given by 


From the Mohr’s circle it can be seen that the principal axis 1, i.e. the p-axis is 31.27° 
clockwise from the x-axis. The principal axes can now be drawn on a sketch of the 
element as shown in Figure 3.109. 


4 + 4 

centre C = — ^ — = 314659 mm 4 


J 




I 



0 = 31.27° 



Figure 3.109 Angle of principal axes 
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Bending of beams with asymmetric sections 

Figure 3.110 shows an arbitrary cross section of a beam subjected to a bending moment, M, 
acting at an angle 0 to the x-axis. The origin of the x-y-axes coincides with the centroid of the 
section. The bending moment has two components, M x and M y , as shown, acting about the x- 
and y-axes respectively. 


Note The bending moment and its components are drawn in vector form with a double 
arrowhead. The right-hand screw rule defines the sense of the bending moment component as 
shown in the figure inset. 


Assume that bending takes place about the x-axis, i.e. O-x is the neutral 
axis. Then, the bending stress, ct, is proportional to the distance, y, from 
the neutral axis, or alternatively 


ct = c-y 

where c is an arbitrary constant. 

The resultant moment about the x-axis is given by the sum of moments 
of the forces acting on each elemental area in the cross section. In the 
limit, this sum can be written as an integral: 

M x = J cr ■ ycL4 
= J A c-yydA 



Figure 3.110 Bending about an arbitrary 
cross-section 


where I x = second moment of area about the x-axis. 
But 


c = cj/y 
M x 

= T y 

which is the beam bending equation as expected. 

However, there is also a resultant moment about the y-axis: 

M y = — J a ■ xcL4 

= — J c-y xcL 4 

= -cI xy 

where 4 y = JxydA is the product moment of area 


Note The negative sign arises because a positive stress results in a negative moment about the 
y-axis. 

Thus, in general, a moment has to be applied about the y-axis as well as the x-axis to 
produce bending about the x-axis only. A positive moment is required about the y-axis to 
counterbalance the negative moment set up by the stresses arising from M^.This is not the case 
if I xy is zero, i.e. for sections that are symmetric about the y-axis. 

To ensure bending about the x-axis only, a resultant moment 
M = JM 2 X + M 2 y 
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must be applied at an angle, 0, given by 



The resultant moment is only applied about the x-axis when I xy = 0. 

Figure 3.111 illustrates the effect for a zr-section. If a bending moment is 
applied about the x-axis only, then the stresses in the flanges will create a 
resulting moment about the y-axis. Consequently, bending will take place 
about both the x- and y- axes. This is a consequence of I xy not being zero for 
this asymmetric section. 

To avoid this moment coupling effect, it is usually convenient to solve 
bending problems by considering bending about the principal axes of a 
section for which the product moment of area is zero. 


Solving asymmetrical bending problems 

Consider the arbitrary asymmetric section shown in Figure 3.112(a). O is the 
centroid and O-p and O-q are the principal axes of the section. The principal 
axes are inclined at an angle 0 to the x-y-axes. Components of an applied 
moment M, i.e. M x and M y , act about the O-x and O-y axes respectively. 
Firstly, M x and M y are resolved onto the principal directions, as illustrated in 
Figure 3.112(b), giving, 


Tensile 


y. 


Compressive 



Figure 3.1 11 2 -section 


and 


M p = M x cos 0 + M y sin 0 


M f = ~M X 


0 + My cos 0 



o 

Figure 3.112 Asymmetrical section 



o 


We can now calculate the total bending stress at any position, ( p,q ), which arises from these two 
bending moment components and is given by 

M-q M-p 

<j b = ~ T f- (3.66) 

p 

Note When p and q are both positive, i.e. in the first quadrant of the p-q axes set, a positive, M p 
gives rise to a positive bending stress while a positive M„ gives rise to a negative bending stress. 
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The maximum stress in the section will occur at the extreme distance from the neutral axis. We 
therefore need to determine the position/ orientation of the neutral axis which can be found by 
setting the bending stress, i.e. equation (3.66), to zero.Thus, the neutral axis occurs where, 

M p -q M q -p 

= ~ p J^ = ° 


M p'<l _ M q'P 
I p ~ !q 
i= Mq Ip 
V M p I q 


This value for q/p defines the angle, a, of the neutral axis, with respect 
Figure 3.112(a), as follows, 


a = 


Uvi. 


the p-axis, shown in 


(3.67) 


Equation (3.66) can therefore be used to determine the magnitude of the stress at any position 
( p,q ), and equation (3.67) can be used to determine the orientation of the neutral axis and 
hence the position of the maximum stress, which is at the extreme distance from the neutral 


Summary of the procedure for solving asymmetrical bending problems: 

(1) Determine the principal axes of the section, p and q, about which I xy = 0. 

(2) Consider bending about the principal axes, i.e. resolve bending moments onto these axes. 

(3) Knowing M p , M q , I p and I q , determine the general expression for the bending stress at 
position (p, q) as follows, 


(4) Determine the angle of the neutral axis with respect to the p-axis: 
.ta\ JM-IA 


(5) Evaluate the bending stress at any position in the section including the extreme positions 
from the neutral axis which give the maximum bending stresses. 


Worked example 

Asymmetrical bending 
The angle section, shown in Figure 
3.113, with principal axes and 
principal second moments of area 
indicated, is subjected to a bending 
moment of 300 Nm about the 

Determine: 

(a) the position/ orientation of the 
neutral axis 

(b) the bending stresses at positions 
A, B and C. 



Figure 3.113 Bending about an asymmetrical 
angle section 
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Resolving the applied moment: 



o o 


Figure 3.114 Transformation onto principal axes 


Referring to Figure 3.114(a), the components of the applied bending in the p- and 
q- directions are 

M p = M cos(31.27) = 0.855M 
M q = M sin (3 1.27) = 0.519 M 

Bending stress equation: 

The general expression for bending stress at position (p,q ) is, 

Mp-q M q -p 

° h = ~^r V 

300. 10 3 0.855q 300.10 3 0.519p 

508 554 “ 120764 


= 0.5042^ - 1.2894? 

Note that for p and q in mm, this expression gives bending stress in N/rnm 2 , i.e. MPa. 
Orientation of the neutral axis, a, with respect to the p-axis: 


M 


= t^-i/ 1-2894 1 _ 
1 0.5042/ 


_1 (2.558) 


= 68.64° 

Orientation of the neutral axis with respect to the x-axis = 68.64 — 31.27 = 37.37°. 
This orientation is illustrated in Figure 3.113. 


Bending stresses: 

To determine the bending stresses at A, B and C, we need the p and q coordinates of these 
points. Referring to Figure 3.114(b), the general coordinate transformation equations for 
a set of axes ,p-q, inclined 0 anticlockwise from another set, x-y, are, 


and 


p = x cos 0 + y sin 0 


q = — xsin0 + y cos 0 


For this problem, the p-axis is inclined at 31.27° clockwise to the x-axis. Thus, 
0 = —31.27° and the above transformation equations become 


p = 0.8547X - 0.5191y 

and 


q = 0.5191* + 0.8547y 
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◄ 


We can now draw a table for calculating the coordinates of A, B and C as follows 



and the stresses follow from the general equation a h = 0.5042 q — 1 ,2894p, as follows: 
at A 


ct a = 36.33 MPa 


I 


atC 


ct b = -6.67 MPa 


ct c = -34.24 MPa 


Learning summary 

By the end of this section you should have learnt: 

/ an asymmetric cross section, in addition to its second moments of area about the x- and y-axes, 

4 and I y , possesses a geometric quantity called the product moment of area, I xy , with respect to 
these axes; 

/ how to calculate the second moments of area and the product moment of area about a 
convenient set of axes; 

/ an asymmetric section will have a set of axes at some orientation for which the product moment of 
area is zero and that these axes are called the principal axes; 

/ the second moments of area about the principal axes are called the principal second moments of 
area; 

/ how to determine the second moments of area and the product moment of area about any 
oriented set of axes, including the principal axes, using a Mohr’s circle construction; 

/ it is convenient to analyse the bending of a beam with an asymmetric section by resolving bending 
moments onto the principal axes of the section; 

/ how to follow a basic procedure for analysing the bending of a beam with an asymmetric cross 
section. 


3.10 Strain energy 

We saw in Section 3.4 how deflections of a beam can be determined by solving the 
differential equation of the elastic line, using Macaulays method. This method is not 
appropriate for more complex shaped structures or bodies or where combined loading is 
applied. In this section we introduce the concept of strain energy which will enable us to 
calculate deflections of complex structures. When an elastic body is subject to loading, strain 
energy is stored within the body. An Italian railway engineer, called Carlo Alberto Castigliano, 
derived a theorem and procedure for using the strain energy to determine deflections in 
structures or bodies. Castigliano s theorem is a powerful and flexible method for solving 
deflection problems. 
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Strain energy expressions 

The strain energy in an elastic body is equal to the work done on the 
body by the applied loads. Thus, when an elastic body is subjected 
to a single applied load P, causing it to displace by a distance u at the 
position of load application, as illustrated by Figure 3.115, the strain 
energy (denoted by the symbol capital 17) is given by, 

strain energy, 17 = work done = I Pdu 
Jo 

For a linear elastic body, 

17 = jP- u (3.68) 

where P = final load 
and u = final displacement 

Similar expressions can be derived for moments, torques and couples. Thus, when a moment is 
applied to an elastic body, causing it to rotate through an angle 0, the strain energy is given by, 

17 = \M- 0 (3.69) 

where M = final moment 
and 0 = final angle of rotation 

We will now use equations (3.68) and (3.69) to derive expressions for the strain energy in a 
bar under tension, torsion and bending. 



Figure 3.115 Strain energy given by area 
under load-displacement curve 


Tension in a bar 

Figure 3.116 shows an element of the length of a bar, length 8s, 
cross-sectional area, A, which is extended 8w due to a tensile load P. 

The strain energy for the element, 817, is given by, 

817 = work done = \ Pdu (3.70) 

Note: There are transverse strains/displacements due to Poisson’s 
effect but no transverse stresses/loads. Thus, there is no work done 
in the transverse direction. 





Figure 3.1 16 Strain energy in a bar under tension 


Now, 


strain e = extension/ original length = 


Also, by Hooke s law 


Substituting (3.71) into (3.70) gives 


_ a _ P 
~ E~ EA 

8 u _ P 
8s “ EA 



(3.71) 


(3.72) 


This is the expression for strain energy in an element of bar. Thus, for a bar of length L, with 
possibly varying A and E, integrating (3.72) gives 

/ P 2 


17 


r p 2 
Jo 2AE 


ds 
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which is the general expression for strain energy in a bar under tension. This expression can 
also be used for a bar under compression. 

For constant A and E, 


(3.74) 


So, knowing the load, geometry and material stiffness, we can use expression (3.72) or (3.73) to 
determine the strain energy in the bar. 


Strain energy per unit volume 

It is interesting to note that, from (3.68), the strain energy per unit volume is given by. 


W 1 P8» 

8 F 2A§s 


E 


strain (for uniaxial stress) 


Torsion in a shaft 


86 

C(- (*}G , 

h * H 

Figure 3.117 Strain energy in a shaft unc 

Figure 3.117 shows an element of the length of a shaft, length 8 s, that is twisted through angle 
80 due to a torque T. 

The strain energy for the element, 8 U, is given by 

8 U = work done = \ T 80 (3.75) 

Now for a circular shaft of polar second moment of area, J, and shear modulus, G, the torsion 
equation, gives 



80 = 7^8 s (3.76) 

Substituting (3.76) into (3.75) gives, 

SL7=^8s (3.77) 

This is the expression for strain energy in an element of the shaft. Thus, for a shaft of length L, 
integrating (3.77) gives 

u ~Cm ds i 3 - 78 ' 


which is the general expression for strain energy in a shaft under torsion, and you can see the 
similarity in form with equation (3.74) for a bar under tension. 
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Bending of a bar 

Figure 3.118 shows an element of a straight beam, length 8s, which bends to curvature R, due 
to an applied bending moment M.The angle subtended by the element of beam is &4>, also 
equal to the change in slope of the beam over 8s. 


R 



Figure 3.118 Strain energy in a beam under bending 


The strain energy for the element, 8(7, is given by, 

8 U = work done = jM8<|> (3.79) 

Now, R ■ 84> = 8s and = -j|, the beam bending equation. 


8s _ 

-R~EI hs 


Substituting (3.80) into (3.79) gives, 


8(7 = 


M 2 c 
2 E/ 8 - 


Thus, for a beam of length L, integrating (3.81) gives, 


U = 


m* 


(3.80) 


(3.81) 


(3.82) 


i.e. the general expression for strain energy in a beam under bending, and again you can see the 
similarity in form to equations (3.73) and (3.78). 

The three expressions for calculating strain energy can now be summarized as follows, 




In practical engineering structures, where members are relatively long and slender, strain energy 
due to tension and compression can usually be neglected. Bending is usually dominant. Also, 
strain energy due to shear deflections can exist but, again, it can normally be neglected. 


Castigliano’s theorem 

Consider an elastic body loaded by forces, P b as shown in Figure 3.1 19(a). The corresponding 
displacements of the load points in the direction of the loads are For the general non-linear 
elastic case, the load— displacement curve for the ith load is shown in Figure 3.119(b). 

If all forces are applied simultaneously and proportionately, the total stored strain energy, U, in 
the body, is equal to the work done by the forces as follows, 

u -E u '-El p ‘ du ‘ 

where (7, is the area under the load— displacement curve for one loading. 
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o o 

Figure 3.119 (a) Elastic body subject to several loads and (b) load- 
displacement curve for the /th load acting on the body; 17, • is the strain 
energy and U- is the complementary energy 


We can also define the complementary energy, U*, as follows, 

U * - E U 7-E ! u ‘ dp ‘ 

where U*is the area above the load-displacement curve for one loading. 

Now, consider a small increment of the load P„ A P„ while the other loads remain constant. This 
causes an increment of the complementary strain energy, shown in Figure 3.119(b) and given by 

A U* = J UjdPj ~ m,AP, 

For the other loads, Pj,j # i, Pj is constant and 

AL7/ = J UjdPj = 0 

The total change in complementary strain energy is therefore, 


In the limit, 
and 


U*= ^A(7* = A 17* = m,AP,- 
A P, => 0 


lim A U* = dlf 
A P, = 0 A P, dP, 


(3.84) 


Note that this is a partial derivative because all other loads except P,- were kept constant. 

Equation (3.84) is the general form of Castigliano’s theorem, which, as stated in the 
introduction, is named after the Italian railway engineer, Carlo Alberto Castigliano (1847— 
1884), who developed the method. His theorem states that the deflection, at a given load 
point t, may be obtained by differentiating the complementary strain energy, U , with respect 
to the load, P„ acting at the point. 

Note that «,• is the deflection at the point of application of the load, P„ in the direction of P„ 
and P, is independent of other loads. 

Equation (3.84) applies to any non-linear elastic body. For linear elastic bodies, strain energy is 
equal to the complementary strain energy as shown in Figure 3. 120. Thus, for a linear elastic 
body, 
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and 


U= U* 


= £IZ 

U ‘ dPj 


(3.85) 


Equation (3.85) is the well-known form of Castigliano s theorem and states that the 

partial derivative of the strain energy, U, of a linear elastic system with respect 
to any independent force is equal to the displacement of the structure at the 
point of application of the force in the direction of the force. 

Castigliano s theorem may be extended to include rotations due to moments and torques: 


dU 

dM t 


(3.86) 



where 0, is the rotation at the point of application of the moment M t about the axis 
of M ; . 


Figure 3.120 Load-displacement 
line for a linear elastic body 


The above forms of Castigliano s theorem (equations (3.85) and (3.86)) can be used to 
determine deflections/ displacements and rotations at a point in a body or structure where a 
load or moment is applied. The procedure is as follows: 


(1) Obtain expressions for the total strain energy within a structure or body in terms of the 
applied loads or moments using equations (3.83). 

(2) Differentiate the strain energy expressions with respect to the applied loads or moments, 
as in equations (3.85) and (3.86), to determine expressions for deflections or rotations 
respectively. 

(3) Evaluate numerically the deflections and/ or rotations. 


Notes on further use of Castigliano’s theorem 

(1) To determine the deflection or rotation at a point where 
a load or moment is not applied, include a dummy load 
or moment at the point. Obtain and differentiate a strain 
energy expression incorporating the dummy load/moment, 
and set the latter to zero when numerically evaluating the 
deflections/rotations. Thus, to determine the horizontal tip 
deflection for the structure shown in Figure 3.121(a), with 
an applied vertical load, P, it is necessary to add a dummy 
load F at the tip. Then, 

= 

Up \dFj(F= 0) 

(2) If the displacement or rotation at a point is restrained, 
e.g. at a support as shown in Figure 3.121(b), and the 
reactions are R 0 and M 0 , then, 



] 


o o 

Figure 3.121 Castigliano’s theorem: (a) dummy load, 
F, added to determine the horizontal deflection and 
(b) use of the zero displacement condition to solve 
for unknown reaction loads and moments 


These conditions can be used to determine the reactions R 0 and M 0 . 
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Worked examples 

Beam deflection 

Figure 3.122 shows a simply supported beam, ABC, with a point load at the 
centre of its span. 

Use strain energy to derive an expression for the central deflection of the beam 
(assuming bending strain energy only, no shear). 

C 


Rc 

Figure 3.122 Simply supported beam with 
point load at centre span 


- — — m — - 

Y w 



Reaction forces: 

From symmetry, 

Ra = Rc= W/2 

Bending moment distribution: 

Taking a section, X-X, in the part of the span, AB, and considering the left-hand part of 
the beam as a free body. Moments about X-X give, 


M = 



Strain energy: 

The strain energy stored in AB is given by, 



f L/2 W 2 x 2 

Jo 4.2EI 


Wf\x i] V 2 
8EI [ 3 Jo 

W 2 L 3 
192 El 


From symmetry, 


The total strain energy 



U tot = 2 U A 


W 2 L 3 
96 El 


Deflection: 

The central deflection, i.e. deflection at the load, is therefore given by, 

= dR = Ml 
Yb dW 48 El 
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Dummy load 

A curved beam, whose geometric form is a quadrant in the vertical plane, as 
shown in Figure 3.123(a), has cross-sectional dimensions small compared with 
the quadrant radius. 


Use strain energy to derive an expression for the horizontal and vertical 
deflections of its tip due to a vertical load P. 

Because of the slender nature of the curved beam, direct stress and transverse 
shear strain energies may be neglected compared to bending strain energy. 

To determine the horizontal deflection, a horizontal dummy load, Q, is added 
at the tip of the beam as shown in Figure 3.123(b). 

Bending moment expression: 

As also shown in Figure 3.123(b), to determine the bending moment distribution, a 
section X-X is taken at position B, angle cp from the horizontal, and equilibrium of a 
section AB of the beam is considered. Taking moments about X-X, 


Note that the bending moment is a function of both the applied load, P, and the dummy 
load, Q.Also, the sign of the bending moment is unimportant as it is squared in the strain 
energy integral (see below). 

Strain energy: 

The strain energy expression is now 


Note that ds = Rdtp and that the limits of integration are 0 to tt/2. 

Vertical displacement: 

The vertical displacement is now given by differentiating equation (3.88) with respect to 
the applied load P, as follows, 


Figure 3.123 Curved cantilever beam with point load applied at 
the free end 


1 


j 



M + PR cos c|) + QR{ 1 - sine))) = 0 
M = PR cos <(> - QR( 1 - sin4>) 


(3.87) 



(3.88) 



► 
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To simplify matters, the dummy load, Q, can be si 
performed. We then have, 


o zero before the integration is 


pR ? r ' 2 2 , 

^ = ^ri o cos *** 

^(1 + cos2(f))d4> 


_ ttPR 3 
4EJ 

Horizontal displacement: 

The horizontal displacement is now given by differentiating equation (3.88) with respect 
to the dummy load, Q, as follows, 


du i r” /2 

= 3Q = WiL 2[— PRcos 4> - 


a*)] ■ [-R(l - 


n 4*)] ‘ R d<|) 


Again, setting the dummy load, Q, equal to zero before integrating we have, 

_ PR 3 r 


prr/2 

YJ o cos 4>(1 — sin4>)]d<() 
PR 3 r' 2 A 1 ' OJLJA 

= ~eT Jo coset) --Sin 24) dct) 

PR 3 \ ■ , , 1 0 ,T /2 

’ ~eT [ sm< t) + 4 c°s24)J , 


_ 

- 2B/ 

Note that the vertical deflection is tt/ 2 X horizontal deflection. 


Combined strain energy 

A circular cross section, offset cantilever, ABC, as shown in Figui 
loaded at its free end by a load, P. 



Figure 3.124 Offset cantilever beam with a point load applied 
at the free end 

Neglecting strain energy due to bending shear, derive an expression for the 
vertical deflection at the load point. 
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In this example, section AB of the cantilever stores bending strain energy only, 
while section BC stores both bending and torsional strain energy. 

Strain energy in AB: 

Referring to Figure 3.124(b), which shows a section, X-X, taken distance x from A, the 
bending moment is obtained by considering equilibrium: 

M+ Px = 0 M = -Px 

The strain energy, l/ Ar) , is given by, 


i-fi i d * 

-cm* 

P 2 L f 
6 El 


(note the limits 0 to for the length) 

Strain energy in BC: 

Referring to Figure 3.124(c), which shows a section, X-X, in BC, taken distance x from 
B, the bending moment and torque are obtained by considering equilibrium: 

M+ Px = 0 M = - Px 

T + PL X = 0 T= -PL X 

The strain energy, U nc , is now given by 




■I 


^P 2 x 2 , 




D P 2 Lfl 
2 GJ C 


So, the total strain energy in the cantilever is 

\Ll+Ll 


u=u AB + u BC = P 2 \- 


6EI 


Lj L, 

" 2 GJ \ 


Deflection of the tip: 

The deflection of the tip is now given by differentiating equation (3.89) with respect t< 
the applied load, P, as follows, 

[ Lfl + L 2 3 L x 2 L 2 1 
6EI 2 GJ \ 


v = mf = 2 P ■ 


Learning summary 


By the end of this section you should have learnt: 

/ the basic concept of strain energy stored in an elastic body under loading; 

/ to calculate strain energy in a body/structure arising from various types of loading, including 
tension/compression, bending and torsion; 

/ Castigliano’s theorem for linear elastic bodies, which enables the deflection or rotation of a body at 
a point to be calculated from strain energy expressions. 
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3.11 Fatigue 

Introduction 

Fatigue failure of components and structures results from cyclic or repeated loading and from 
the associated cyclic stresses and strains, as opposed to failure due to monotonic or static stresses 
or strains, such as buckling or plastic collapse due to excessive plastic deformation. The topic of 
fatigue is extremely important in mechanical engineering, since machines have moving parts, 
which in turn give rise to stresses and strains that may vary with time, typically in a repetitive 
fashion. For example, the axle of a car which will transmit a time-varying torque, that changes 
from zero to some finite value when the car is put into gear and driven (and back to zero again 
when the car is taken out of gear). 

An important design consideration, with respect to fatigue, is the fact that fatigue failure can 
occur at stresses that are well below the ultimate tensile strength of the material and often 
below the yield strength (or 0.2% proof stress). 

Basic phenomena 

The failure mechanism for an initially uncracked component with a smooth (polished) surface 
can be split into three parts, namely crack initiation, crack propagation and final fracture, as 
follows: 

Stage I crack initiation: The microstructural phenomenon 
that causes the initiation of a fatigue crack is the development 
of persistent slip bands at the surfaces of the specimen. These 
persistent slip bands are the result of dislocations moving 
along crystallographic planes, leading to both slip band 
intrusions and extrusions on the surface. These act as excellent 
stress concentrations and can thus lead to crack initiation. 

Crystallographic slip is primarily controlled by shear stresses 
rather than normal stresses so that fatigue cracks initially tend 
to grow in a plane of maximum shear stress. This stage leads to 
short cracks, usually only of the order of a few grains. 

Stage II crack propagation: As cycling continues, the 
fatigue cracks tend to coalesce and grow along planes of 
maximum tensile stress range. 

Final fracture: This occurs when the crack reaches a critical length and it results in either 
ductile tearing (plastic collapse) at one extreme, or cleavage (brittle fracture) at the other. 




subjected to cyclic stress 


Figure 3.126 Schematic of stages I and II transcrystalline 
microscopic fatigue crack growth 
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Fatigue life analysis 

In order to allow for fatigue in the analysis and design of components, a number of different 
approaches are adopted, two of which are described here. The more traditional approach is 
what is now referred to as the total life approach, based on laboratory tests, which are carried 
out under either stress- or strain-controlled loading conditions on idealized specimens. These 
tests furnish the number of loading cycles to the initiation of a ‘measurable’ crack as a function 
of applied stress or strain parameters. The ‘measurability’ is dictated by the resolution accuracy 
of the crack detection method employed. A typical ‘measurable’ crack is about 0.75 mm to 
1 mm. The challenge of fatigue design is then to relate these test results to actual component 
lives under real loading conditions. 

The second approach is known as the damage tolerant approach. This approach is based on 
the inclusion of fatigue as a crack growth process, taking account of the fact that all components 
have inherent flaws or cracks. The development of fracture mechanics techniques to predict 
crack growth has facilitated this approach as a competing technique to the total life approach. 
Both of the approaches have advantages and disadvantages; the former has more appeal to design 
engineers while the latter is more often used by material scientists and researchers. Nonetheless, 
even in routine design, the damage tolerant approach is gaining popularity. 


Total life approach 

The total life approach is based on the results of stress- and 
strain-controlled cyclic testing of laboratory test specimens of 
material, in order to obtain the numbers of cycles to failure as 
a function of the applied alternating stress, for example. Figure 
3.127 shows a rotating bending test machine set-up. This is a 
constant load amplitude machine since the load doesn’t change 
even with crack growth. The specimens usually have finely 
polished surfaces to minimize surface roughness effects, which 
would particularly affect stage I growth. In this approach, no 
distinction is made between crack initiation and propagation. 

Stress concentration effects can be studied by machining in 
grooves, notches or holes. 

Traditionally, most fatigue testing was based on fully reversed (i.e. zero mean stress, S m = 0), 
stress-controlled conditions, and the fatigue design data was presented in the form of S-N 
curves, which are either semilog or log-log plots of alternating stress, S a , against the measured 
number of cycles to failure, N, where failure is defined as fracture. Some of the important stress 
parameters for cyclic loading are shown in Figure 3.128. 


n / i r 

~T 



Q=m 


IX 

Motor 


Figure 3.127 Rotating bending moment test 
apparatus for fully reversed fatigue loading 
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Figure 3.129 contains schematic representations of two typical S-N curves obtained under 
load- or stress-controlled tests on smooth specimens. Figure 3.129(a) shows a continuously 
sloping curve while Figure 3.129(b) shows a discontinuity or ‘knee’ in the curve. A knee is 
only found in a few materials (notably low-strength steels) between 10 6 and 10 7 cycles under 
non-corrosive conditions. The curves are normally drawn through the median life value 
(of the scatter in N) and thus represent 50 per cent expected failure. The fatigue life, N, is 
the number of cycles of stress or strain range of a specified character that a given specimen 
sustains before failure of a specified nature occurs. Fatigue strength is a hypothetical value of 
stress range at failure for exactly N cycles as obtained from an S-N curve. The fatigue limit 
(sometimes called the endurance limit) is the limiting value of the median fatigue strength as 
Nbecomes very large, e.g. >10 8 cycles. 




Effect of mean stress 

The alternating stress, S a , and the mean stress, S m , are 
defined in Figure 3.128. Early investigators of fatigue 
assumed that only the alternating stress affected the fatigue 
life of a cyclically loaded component. However, it has 
since been established that the mean stress has a significant 
effect on fatigue behaviour, as shown in Figure 3.130. It 
can be seen that tensile mean stresses are detrimental while 
compressive mean stresses are beneficial. 

The effect of mean stress is commonly represented as a plot 
of S a versus S m for a given fatigue life. Attempts have been 
made to develop this relationship into general relations. 
Three of these common relations between allowable 
alternating stress for a given life as a function of mean stress 
are shown in Figure 3. 131. The modified Goodman line 
assumes a linear relationship between the allowable S a and 
the corresponding mean stress S m , where the slope and 
intercepts are defined by the fatigue life, S e , and the material 
ultimate tensile strength (UTS), S„, respectively. The Gerber 
parabola employs the same end-points but, in this case, the 
relation is defined by a parabola. Finally, the Soderberg line 
again assumes a linear relation, but this time the mean stress 
axis end-point is taken as the yield stress, S r The modified 
Goodman line, for example, can be extended into the 
compressive mean stress region to give increasing allowable 
alternating stress with increasing compressive mean stress, 
but this is normally taken to be horizontal for design 
purposes and for conservatism. 


I 


10 3 10 4 10 5 1 0 6 1 0 7 1 0 8 

N 

Figure 3.130 The effect of mean stress on fatigue life 
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Effect of stress concentrations 


Ever since the first occurrences of fatigue failure, it has been recognized that such failures are 
most commonly associated with notch-type features in components. It is impossible to avoid 
notches in engineering structures, although the effects of such notches can be reduced through 
appropriate design. The stress concentration associated with notch-type features leads typically 
to local plastic strain, which eventually leads to fatigue cracking. Consequently, the estimation 
of stress concentration factors associated with various types of notches and geometrical 
discontinuities has received a lot of attention. This is typically expressed in terms of an elastic 
stress concentration factor (SCF), K t , which is simply the relationship between the maximum 
local stress and an appropriate nominal stress: 



It was once thought that the fatigue strength of a notched component could be predicted as 
the strength of a smooth component divided by the SCF. However, this is not the case. The 
reduction is, in fact, often less than K t and is defined by the fatigue notch factor, Kj, which is 
defined as the ratio of the smooth fatigue strength to the notched fatigue strength: 



However, this fatigue notch factor is also found to 
vary with both alternating and mean stress level and 
thus with number of cycles to failure. Figure 3.132 
shows the effect of a notch, with an SCF of 3.4, 
on the fatigue behaviour of a wrought aluminium g 

alloy, where the smooth lines are for the smooth § 

specimen and the dotted lines are for the notched 
specimen. The table shows how the fatigue notch 
factor changes with mean stress level and fatigue 
life. Clearly, the fatigue notch factor increases from 
3.2 to 5.7 from 10 4 cycles to 10 7 cycles at 172 MPa 
mean stress, but remains unchanged between these 
lives at 2.3 for zero mean stress. 


Mean stress 

1 0 4 cycles 

1 0 7 cycles 1 

OMPa 

51/22 = 2.3 

22/9 = 2.3 

172 MPa 

42/13 = 3.2 

17/3 = 5.7 



Figure 3.132 Constant life diagrams for a wrought aluminium 
alloy for both smooth and notched specimens 


S-N design procedure for fatigue 

Constant life diagrams plotted as S a versus S m , also called Goodman diagrams, c 
design to give safe estimates of fatigue life and loads. 

/ = = factor of safety 

fSa 


similar triangles - ^ 


kjS a 

= 
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(1) The Goodman line connects the endurance limit, S e (or long-life fatigue strength), to the 
UTS, S u . 

(2) The fatigue strength for zero mean stress is reduced by the fatigue notch factor, Ky-.The 
stress concentration factor, K t is used if Kj is not known. 

(3) For static loading of a ductile component with a stress concentration, failure still occurs 
when the mean stress is equal to the UTS. Failure at intermediate values of mean stress is 
assumed to be given by the dotted line. 

(4) In order to avoid yield of the whole cross section of the component, the maximum 
nominal stress must be less than the yield stress, S y 

S m + S a < S y 

This relationship gives the yield line joining S' to S' . 

(5) The region of the diagram nearest to the origin is the ‘safe’ region (can also be extended to 
include compressive yield). 

(6) A component is assessed by plotting the point corresponding to the nominal alternating 
stress, S a , and the nominal mean stress, S m , i.e. not the maximum values associated with the 
notch. The factor of safety is determined from the position of the point relative to the safe/ 
fail boundary, i.e. factor of safety F = OB/OA. 

From similar triangles: 


S a S e 



i = ^f s* 

F S e S u 


A procedure similar to that described above for long life can also be used to design for a 
specified number of cycles. In this case the endurance limit and the fatigue notch factor are 
replaced by the fatigue strength and the fatigue notch factor for the specified number of cycles. 


Learning summary 

By the end of this section you should have learnt: 

/ the various stages leading to fatigue failure; 

/ the basis of the total life and of the damage tolerant approaches to estimating the number of 
cycles to failure; 

/ how to include the effects of mean and alternating stress on cycles to failure using the Gerber, 
modified Goodman and Soderberg methods; 

/ how to include the effect of a stress concentration on fatigue life; 

/ the S-N design procedure for fatigue life. 


3.12 Fracture mechanics 


Linear elastic fracture mechanics (LEFM) 

Consider the stress concentration factor for an elliptical hole in a large, linear-elastic plate 
subjected to a remote, uniaxial stress. 

It can be shown that the stress concentration factor is as follows: 


K' = t;= 1+^r 
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Figure 3.134 Elliptical hole in an infinite plate 


Thus, as b — * 0, the elliptical hole degenerates to a crack, and ^ — > oo, so that the notch stress 

also goes to infinity (i.e. becomes singular), — = oo, provided that the material behaviour 
remains linear elastic. 

The root radius for an ellipse is given by 

p = ~- b = Jap 


so that 



and again, as the notch tip radius goes to zero, i.e. p — > 0, the notch tip stress again goes to 
infinity: 


cr 

CT 


2 /f- 


The singular (infinite) state of stress at a crack tip is one of the fundamental and most 
important aspects of fracture mechanics. 


Basis of the energy approach to fracture mechanics 

Griffith (1921) studied the brittle fracture of glass and adopted an energy approach to solving 
the problem. He reasoned that unstable crack propagation occurs only if an increment of crack 
growth, da, results in more strain energy being released than is absorbed by the creation of the 
new crack surfaces. This can be re-expressed as the change in strain energy dU, due to crack 
extension, being greater than the energy absorbed by the creation of the new crack surfaces. 
Thus, if we designate the surface energy per unit area of the crack y s , then the surface energy 
associated with a crack of length 2 a in a body of thickness B (as shown in the Figure 3.135) is 
given by: 

W s = 4 aBy s 

Detailed stress analysis of an elliptical hole in an infinite elastic plate has established that the 
strain energy in such a body is 


where cr is the remote stress (away from the hole) and where, for plane strain and plane stress, 
respectively, 
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The total system energy is thus 


According to Griffith, the critical condition for the onset of crack growth i: 

dU _ Tt aa 2 

d A ' 

Therefore: 


- + 2y s = 0 


t t t t t t t° 



! I I ! I ! I 

Figure 3.135 Crack in an infinite plate 


where A = 2aB is the crack area and dA denotes an incremental increase in crack area. 

The total surface area of the two crack surfaces is 2 A This relationship is conventionally 
re-expressed as 

G = G c 

where G is called the strain energy release rate, the crack tip driving force or the crack 
extension force. G c is a material property, which is known as the critical strain energy release 
rate, the toughness or the critical crack extension force. A high value of G c means that it is 
difficult to cause unstable crack growth in the material whereas a low value means that it is 
easy to make a crack grow unstably. Thus, copper, for example, has a value of G c ~ 10 6 J/m 2 , 
whereas glass has a value of G c ~ 10 J/m 2 . The following relationships for plane stress and plane 
strain, respectively, follow from the above: 

G = 77 ^ (plane stress) 

(1 - v 2 ) 

G = p ttckj- (plane strain) 


Note that plane stress and plane strain are two contrasting two-dimensional assumptions that 
permit simplification of three-dimensional problems to two-dimensional ones. Plane stress 
corresponds physically to thin plate type situations while plane strain corresponds to thick plate 
type situations. Plane strain testing of fracture leads to lower values of G c , so that the material 
property value of G c for design purposes is taken as the plane strain value and is designated as 
Q, 

The critical stress that causes a crack to propagate in an unstable fashion, giving fracture, is 
governed by the following relationships 

oV'Tta = \]EG C (plane stress) 


i (plane strain) 


Since the terms on the right-hand side of these equations are material constants, and since the 
term on the left-hand side is so common, it is usually abbreviated to the symbol, K, which is 
referred to as the stress intensity factor, and the equations can be re-expressed as: 

K = K c 

where K c is called the critical stress intensity factor or the fracture toughness. Thus: 

K c = \[EG C 

In summary, K = oVt fa is called the stress intensity factor 
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K c is called the fracture toughness or critical stress intensity factor 
G c is called the toughness or the critical strain energy release rate. 

Note: Most materials are not linear elastic up to failure. However, the energy 
approach can still be used if the plastic strain is restricted to a region 
very close to the crack tip; this is referred to as small-scale yielding. 

Under these conditions, the energy release rate can still be reasonably 
accurately based on a linear elastic analysis. Also, G c , or G [c , now 
includes a component associated with plastic deformation of the 
crack tip as well as the creation of the surfaces. So far, we have 
only considered the so-called mode I loading case. There 
are actually three different loading modes considered in 
fracture mechanics. 

In general, the energy release rate under mixed-mode Mode !_ opening mode 
loading is given by Mode n _ shearing mode 

G = G[ + G n + Gjn Mode III— tearing mode 

Figure 3.136 Crack tip 

Elastic crack tip stress fields 

Westergaard (1939) established the following equations for the 
elastic stress field in the vicinity of a crack tip: 

= 7fe cos (i) I 1 ' sin (f) sin (f )1 + non - sin g ularterms 

*i /e\ L . . m . ( 30\i , . . 

< j y = ^ — cos ^ 2 / [1 + Sln \2/ Sm \"2 _ / J + non-singular terms 

K: . (Q\ (Q\ 1 30 \ , . , 

T xy = ^ — sm \2/ COS \2/ COS \~2~/ + non-smgular terms 

t zx = T zy = 0; cj z = 0 (plane stress) 
a , = v(a x + (T y ) (plane strain) 


1 ,, 
K 


Figure 3.137 Crack tip stress fields 
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Ki is the mode-I stress-intensity factor (units N/m 3/2 ) which defines the magnitude of the 
elastic stress field in the vicinity of the crack tip. Similar expressions exist, in terms of K n and 
fC H |, for the modes II and III loading situations. For mixed-mode loading, the stress fields can 
be added together directly. It can be seen that K h K n and K w characterize the entire stress 
field (and hence the strain field) in the vicinity of the crack tip. It therefore seems reasonable 
to assume that, for mode I loading, for example, failure will occur when K\ reaches a critical 
value K c (K lc under plane strain conditions). 

The energy approach and the stress intensity approach are equivalent. Generally, for plane stress 

Gtotai = G, + G„ + Qu = |(Ki 2 + K n 2 + K m 2 ) 

(1 - v 2 ) 1 

and for plane strain ^ is replaced by 

Generally, for geometries with finite boundaries, the following expression is employed for stress 
intensity factor 

K, = YaVm 


and similarly for K u and K m , where Y is a function of the crack and component dimensions. 


Material 

a y (Mi\7m 2 ) 

KfMNl m 3/2 ) 

mild steel 

220 

140-200 

pressure vessel steel (HY130) 

1700 

170 

aluminium alloys 

100-600 

45-23 

cast iron 

200-1000 

20-6 


Table 3.3 Typical fracture toughness values 


Solutions for Y can be found in the literature for a wide range of geometries and loadings, e.g. 

• H.Tada, P. Paris and G. Irwin, 1973, The stress analysis of cracks handbook, Hellertown, 
Pennsylvania: DEL Research Corporation. 

• G.P. Rooke and D.J. Cartwright, 1975, Compendium of stress intensity factors, HMSO. 

• Y Murakani (Editor), 1987, Stress-intensity factors handbook, Oxford: Pergamon Press, 

(2 volumes). 

The effects of finite boundaries on expressions for stress intensity factors can be seen from the 
typical expressions shown below. 


2 a 

IT 

V'Tnd (sec W J 
K'n — fyTf d ^ small 

2a 

mio 




tt t r 

- — IT— 

tttt; 


Ki = 1.12oVTta ^ small 
or K { = YaYa 

with V - 1.99 - 0.41 al + 18,7 (jyf - 38.48 (jyf 

+ 53 ' 85 [wf 

(1.99 = 1.12/tt) 
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t t t 

- — W — >S 

TTT 

r 

T| 1 ^~T§ 


= 1.12a%/TTa ^ small 
or K { = Ya-/a 

with Y = 1.99 - 0.76-^ - 8.48 (-£ f + 27.36 
(1.99 = 1.12/tt) 


-«(#r 


— ti) 





Thickness B 

K - P 

~ BW 1/2 


p per unit thickness 
Kj = pVWa 
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At fracture, with 


and 


Then, 


Therefore, 


2 2x1450 MN 

= = ““ 3 ~ ' IF 




= 1 / 72x3 \ 2 

tt 1 1.2x2x1450/ 1 

= 1.226 X 10“ 3 m 
= 1.226 mm 


If the material was mild steel, with 


a c rit = 0.451 m = 451 mm 
i.e. it is much more likely to be detected during inspection! 


Fatigue crack growth 

It has been shown by Paris et al. (1961) that, for a wide 
range of conditions, there is a log-log relationship 
between crack growth rate and the stress intensity factor 
range during cyclic loading of cracked components. This 
has become the basis of the damage-tolerant approach to 
fatigue life estimation and is widely used both in industry 
and in research. Essentially, it means that crack growth 
can be modelled and estimated based on knowledge 
of crack and component geometry, loading conditions 
and using experimentally measured crack growth data 
to furnish material constants. This section describes the 
basics of this approach. 

Considering a load cycle as shown above that gives rise to a load range 

A P = P max - P nnn (3.90) 

acting on a cracked body. The load range and crack geometry give rise to a cyclic variation in 
stress intensity factor, which is given by 

A K=K ma ^-K miB (3.91) 

Even though the stress intensity factor may be less than the critical stress intensity factor 
for unstable crack growth, stable crack growth may occur if the stress intensity factor range, 


Pi 



Figure 3.139 Variation of P (load) 
with t (time) 
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A K, is greater than an empirically determined material property called the threshold stress 
intensity factor range, designated A K th . In addition, Paris showed that the subsequent crack 
growth can be represented by an empirical relationship as follows: 

= C(AX) m (3.92) 

where C and m are empirically determined material constants. This 
relationship is known as the Paris equation. Fatigue crack growth 
data is often plotted as the logarithm of crack growth per load cycle, 
da/ dN, and the logarithm of stress intensity factor range. There are 
three stages. Below A K th , no observable crack growth occurs; region 
II shows an essentially linear relationship between log(da/ dN) and 
log(AK), where m is the slope of the curve and C is the vertical axis 
intercept; in region III, rapid crack growth occurs and litde life is 
involved. Region III is primarily controlled by K c or K k . 

The linear regime (region II) is the region which engineering 
components that fail by fatigue propagation occupy for most 
of their life. Knowing the stress intensity factor expression for a 
given component and loading, the fatigue crack growth life of 
the component can be obtained by integrating the Paris equation 
between the limits of initial crack size and final crack size. 

For most materials, the constant C is found to be dependent on R, 
where R is a measure of the mean stress defined as 



Figure 3.140 Typical (schematic) variation of log 
(dfl/dN) with log (A K) 


R = 


K tm 


s shown in Figure 3.141. 


Material 

AK th (Mi\7m 3/2 ) 

m 

AK(MN/m 3/2 ) 

for da/dN = 10" 6 mm/cycle 

mild steel 

4-7 

3.3 

6.2 

31 6 stainless steel 

4-6 

3.1 

6.3 

aluminium 

1-2 

2.9 

2.9 

copper 

1-3 

3.9 

4.3 

brass 

2-4 

4.0 

4.3-66.3 

nickel 

4-8 

4.0 

8.8 



log(A K) 

Figure 3.141 Effect of R on fatigue 
crack growth 


Table 3.4 Typical values of A k th , m and A k (From Pook (1 978), pp 1 1 4-35.) 

The A K th and A K (for da/dN = 10 -6 mm/ cycle) values depend on the R-value. 


Learning summary ^ 

By the end of this section you should have learnt: 

/ the meaning of linear elastic fracture mechanics (LEFM); 

/ what the three crack tip loading modes are; 

/ the energy and stress intensity factor (Westergaard crack tip stress field) approaches to LEFM; 

/ the meaning of small-scale yielding and fracture toughness; 

/ the Paris equation for fatigue crack growth and the effects of the mean and alternating 
components of the stress intensity factor. 
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3.13 Thermal stresses 

Introduction 

Changes of temperature in a body cause expansion/contraction.This phenomenon is 
quantified by the coefficient of thermal expansion a. In any direction, the change of length 
due to a temperature change, T, is IolT where l is the original length. Therefore, direct thermal 
strain is given by: 


For isotropic materials, a is the same for all directions and there are no thermal shear strains. 
Uniform termperature change throughout an unrestrained body produces uniform strain but 
no stress, i.e. free expansion/ contraction in all directions. 

Generalized Hooke’s law 

s x = -g (<j x — v(cr y + ct 2 )) + a T 
Gy = — v(a x + a 2 )) + a T 

e, = ~ v(cr x + (j y )) + a T 

7 xy = T xy /G 
lyz = T y2 /S 
Izx = T 2X /G 

where T is the temperature at a point relative to some datum. 

By introducing these thermal strains into the generalized Hooke s law we can obtain solutions 
to thermal stress problems, which are often very important in, for example, power and chemical 
plant, aeroengines and internal combustion engines (e.g. pistons and cylinder walls). 

Case 1 : An initially straight uniform beam 



Figure 3.142 

Determine deformations and stresses (small deformations) 

The temperature is (assumed) purely a function of y, i.e. T = T(y). 

The coefficient of thermal expansion = a. Axial force P, and pure bending, about the z-z 
axis ,M, are also applied. 
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Assuming cr z , cr y , t xz and T yz = 0 because the cross-sectional dimensions are small compared 
with the length. 

Also 

T xy = 0, because M does not vary with x, .'. S = — = 0 
Compatibility 

Remote from the ends, strain varies linearly with y, 



Where s is the mean strain (at y = 0) and R is the radius of curvature. 

Stress-strain CT 

s x = ^ + «T{=e + -\ 

(T x = E X {s + - ar} (3.93) 

Axial equilibrium 

P = f a x dA = eJ {s + - aT}cL4 = EsA + -| J ydA - EaJ TdA 

J ydA = 0 

because y is measured from an axis passing through the centroid. 

P = EsA - EaJ TcL4 (3.94) 

Moment equilibrium 

M = J ya x dA = eJ {e + - «T}ycL4 = Esf ydA + y 2 dA ~ EaJ 7ycL4 

Now 

J y 2 dA = I 

(second moment of area, by definition) 

M=~- EaJ TydA (3.95) 


Worked example 

A rectangular beam, width b and depth d has a temperature variation given by: 
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There is no restraint or applied loading (i.e. P = M= 0). Obtain the stress distribution. 
Axial force equilibrium (equation (3.94)) 

P = EsA - EaJ TdA 

4/1 


[d/2 4y 2\ 

0 = Ee-bd - EaJ-jT^l - -j. T J(bdy ) 

»= 7 t *£ 2 {*-$* 


i T [ _V1‘ 

d T °l y 3d 2 ]- 
s = foi-T, 


_ _ 2 

' 3 


With M= O ' w 
from symmetry 

Using equation (3.93) 


At y = O, 


obtain 1/R from the moment equilibrium (equation (3.95)) but 
n see that (1/R) = O. 


- e(|(*T„ + O - uT,{ 1 - 
= EolT 0 [ 


U/ 


—EaT 0 
2 3 





Worked example 

A rectangular beam (again b X d), but with: 

2y 

t=t 0 x y 

It is constrained so that s = 0 and 1/R = 0. 
Determine the stresses and restraints. 
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dyf 

Figure 3.145 

Axial force equilibrium (equation (3.94)) 

P= EeA - Ea I TdA 


a L 

f 2T 0 b t*n 2T 0 b\y 2 ] d/2 

L TdA = ^~J- d/ / dy = ^m = 0 


s = 0 

Moment equilibrium (equation (3.95)) 


M = ^ - EaJ TydA 
f 2T 0 b M/2 2T 0 b\y 3 Y 

L ^=— J _ rf/2 y 2 d ^=— [y| 




Also, 


Eabd 2 T 0 

~~6 


1/R = 0, .-. M = 

Using equation (3.93) (with s = (1/R = 0)) 

v x = (® + R ~ aT ) = ~ Ea ' T 

2 y 

(T x = —EaT 0 -j- 
i.e. 

-2EolT 0 
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Case 2: A thin disc of uniform thickness 

The equilibrium and compatibility equations are as for rotating discs, with w = 0, i.e. 
Equilibrium { raaial ) 



dcr r 

(T r + r~^r - o-e = 0 

(3.96) 

Compatibility 


(3.97) 


S' 

Til'S 

‘I. 

(3.98) 

i.e. 

II 

O' lo- 
ll 

+ 

(3.99) 

However, the stress— strain equations now contain thermal terms. 


i.e. 

£0 = - vo>) + a T 

(3.100) 


s r = -g(c T f — vct 9 ) + a T 

(3.101) 


Note: Because the disc is thin, a,. = 0. 
Substituting (3.100) and (3.101) into (3.99) gives 


“ vae) + aT ) = (b (CT0 “ + olT ) + e[^~ 

d(To da. at 
(1 + v)(a e - a f ) + r-rr - vr~rr + Ear^~ = 


d<7 r \ 

V ~drl 


d T 


' dr 


ct 9 — (T r = r~ t 1 from (3.96) 


da. d(j (l 

(1 + v)r^r + r^r 


da r 

"“dT H 


Also from (3.96): 


r dr\ 2ov + r ~dr r ) = ~ Ear ~dr 
I da. rd 2 (i r dcr,\ d 

A 2 ^ + ~dF + ^) = ~ EcLr ~d 

o d2 °r , d(T r „ d T 
r -dr 2 + 3r ~dr" = - Ear ~dr 


a = A - ^ + P.I. 


do> 

a 0 = ar + r~^ 
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Worked example 

A steel disc with 300 mm outside 
diameter, 40 mm bore diameter and 
5 mm thickness is subjected to a 
temperature distribution 
of the following form: 



where r is the radius in mm. Find the 
hoop stress at the bore. 



The governing differential equation is: 

d 2 a r da, 

r-j-T - + 3 r-j— = — Ecl, 
dr dr 


P.I. = Cr 2 (say) 

r 2 ■ 2C + 3r - 2Cr = -400Ba^2 

_ -50 Ecl 
150 2 



Boundary conditions: 
At r = 20 mm, a, = 0 


0 = A - 7^2 - 50Eaj 


At r= 150mm, a, = 0 


Subtracting (3.103) from (3.102) gives: 

n _ J 1 1 


+ 50Ea 1-Tsa 


B = 20 OOOEa 


Substituting B in (3.103) gives: 


1 + 50Ea = 50.88Ea 


, , _ B _ _r^_ , (2B_B 

'' + r d r ~ A r 2 50Ea 150 2 + r l r 3 150 2 J 


a 9 = A + ^ - 150Eaj 
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Case 3: Thin cylinders 

Thin cylinders are in common use in power and chemical plant, e.g. pipes, pressure vessels, 
etc. Often temperature variations are approximately linear through the thickness. Considering 
positions remote from ends, flanges, etc. 



It is convenient to consider the effect of the uniform temperature change and the temperature 
gradient separately. If the cylinder is not restrained, then the uniform temperature change 
causes overall dimensional changes, but no stress. The stresses due to axial restraint are easily 
calculated. 

For the temperature gradient we have: 

T= A T~ t 

where AT is the temperature difference across the wall. 
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For a thin cylinder: 



and 



Remote from the ends of the cylinder sections remain plane and circular. Therefore, from 
compatibility considerations (with zero mean temperature change), the hoop and axial strains 
must both be zero. Therefore: 


Learning summary 

By the end of this section you should be able to: 

/ understand the cause of thermal strains and how ‘thermal stresses’ are caused by thermal strains; 

/ include thermal strains in the generalized Hooke’s Law equations; 

/ include the temperature distribution within a solid component (e.g. a beam, a disc or a tube) in the 
solution procedure for the stress distribution; 

/ understand that stress/strain equations include thermal strain terms but the equilibrium and 
compatibility equations are the same whether the component is subjected to thermal loading or 
not. 
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0 = ^(cr.e - vd z ) + a(A T)~ t 


and 


0=|(cr 2 -va e ) +a(AT)^ 


Solving, gives 
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Electromechanical drive 
systems 


UNIT OVERVIEW 

■ Introduction 

■ Characteristics of loads 

■ Linear and rotary inertia 

■ Geared systems 

■ Tangentially driven loads 

■ Steady-state characteristics of loads 

■ Modifying steady-state characteristics of a load using a transmission 

■ Sources of mechanical power and their characteristics 

■ Direct current motors and their characteristics 

■ Rectified supplies for dc motors 

■ Inverter-fed induction motors and their characteristics 

■ Other sources of power: pneumatics and hydraulics 

■ Steady-state operating points and matching of loads to power sources 


4.1 Introduction 

In many engineering situations it is necessary to design a system that involves a source of 
mechanical power to drive (directly or indirecdy) a mechanical load of some kind, with one 
or more rotating shafts often being used within the means of transmitting the power from 
source to load. Such systems can range from the very small (for instance, focusing mechanisms 
within digital cameras) to the extremely large (e.g. driving the propellers of a transatlantic 
liner). In all cases, it is essential to understand the characteristics of the load we are trying to 
drive, the characteristics of the source of mechanical power that we are using to drive it, and 
the need for (and function of) any transmission mechanism needed to match the characteristics 
of the power source and load. This unit will therefore examine in depth the steady-state and 
transient characteristics of loads and of typical sources of mechanical power used to drive them, 
including electric motors and other forms of motors and engines. It is also useful to be able to 
alter the characteristics of the power source in order to provide control over the way that the 
overall system behaves, so the techniques by which electric motors can be controlled will be 
explored in some depth. The manner in which the power source and load interact is examined, 
with particular reference to how apparently incompatible characteristics can be matched, in 
order to achieve stable operation in the desired manner. Although this unit draws together 
topics from the disciplines of dynamics, drive systems, electrical engineering, fluid power and 
mechanical design, various texts are available that follow this kind of approach in greater depth, 
for example the text by Fraser and Milne (1994). 
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4.2 Characteristics of loads 

As hinted in the introduction, in order to arrive at an optimal decision on how to drive 

a machine, it is necessary to have a full understanding of how that machine behaves as a 

mechanical load. Specifically, the following questions need to be addressed: 

• How hard is it to get the machine moving from rest? (friction: ‘stiction’ and Coulomb 
friction) 

• How hard is it to speed it up? (inertia) 

• How much harder does it get to drive as we make it go faster? (torque-speed characteristics 
including viscous friction and windage) 

• How hard is it to make it stop when we want to? (inertia again, with some friction 
considerations) 

• How well do the characteristics of the machine match with those of the device we are using 
to drive it, and how can we overcome any mismatch? 

Some of these issues can be illustrated with reference to issues familiar to automobile drivers: 

• A car filled to its maximum design capacity with heavy people is much less responsive (for 
example, when pulling away from traffic lights) than when carrying only the driver. The 
inertia of the load has increased and is no longer very well matched to the power 
source. 

• A car driven at a steady 50 mph uses considerably less petrol to travel a given distance than 
one driven at the legal motorway limit of 70 mph. The steady-state torque required 
from the car engine is determined (in a large part) by wind drag which is 
roughly proportional to the square of the speed. (The drag force falls to something 
like (50/70)“ = 0.51 of its original value, though there are various other losses that show less 
of a reduction.) 

• It is possible to tow quite a heavy trailer with a car as long as the driver allows for 
the poorer acceleration. However, such a trailer must include its own braking system 
(various mechanisms exist) because it would otherwise be difficult to stop the car/ trailer 
combination. The inertia of the system is no longer well matched to the braking 
system (nor, for that matter to the power source). 

• When a steep hill is reached, the driver must change down a gear (and probably settle for 
driving at a lower speed if currently travelling fast). The engine power is limited, and 
in top gear the engine is badly matched to the increased load, so to obtain the 
required torque it is necessary to gear the engine down to obtain a high torque 
at low speed. 


4.3 Linear and rotary inertia 

The concept of inertia is based upon Newton’s second law: 

F=ma ( 4 . 1 ) 

where F is the force required to cause a mass m to undergo an acceleration of a. In this context, 
mass is a measure of how difficult it is to accelerate something — in other words, mass is a 
measure of linear inertia. 

In engineering, we are often concerned with rotational movements, and so we need to 
introduce the concept of rotational inertia, in other words how difficult it is to cause 
rotational acceleration in a body that is capable of rotation about an axis. This is called the 
moment of inertia, a concept which was covered in Section 6.7 of Part 1 and is used within 
Unit 6 of the present volume in the context of torsional vibration. It is briefly revised here for 
continuity. 
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Moment of inertia 


To avoid confusion with electric current I, the symbol J will he used here for moment of 
inertia of a body about its axis of rotation. Moment of inertia is analogous to mass in the 
rotational version of Newton’s second law: 

Torque = moment of inertia X angular acceleration 
i.e. L= Ja (4.2) 


For a body B rotating about a given axis OZ (Figure 4.1),J oz is calculated as follows: 

Jo z = Ifidm (4.3) 

By way of a specific example, a solid cylinder or a disc of mass m and radius r 
(Figure 4.2(a)) has a moment of inertia 

J = \mr 2 (4.4) 

about its axis of symmetry. 



Figure 4.1 A body with axis of 
rotation OZ 


o 





Figure 4.2 (a) Solid cylinder, (b) thin-walled tube, (c) concentrated mass at radius r from axis of rotation 


By contrast, a thin-walled tube or a thin circular ring of mass m and radius r (Figure 4.2(b)) 
has a moment of inertia 


J=mr 2 


(4.5) 


about its axis of symmetry. 

Finally, a concentrated mass m being swung at radius r about a given axis (Figure 4.2(c)) has 
a moment of inertia 


J = mr 2 (4.6) 

about that axis. An example of such a system would be a golf club with a heavy head and a 
light handle, pivoting around the golfer’s wrist. 

Inertias on the same axis are simply added together; objects with portions removed are treated 
by subtracting the inertias of the ‘missing’ portions from the inertia of the overall object. For 
example, the moment of inertia of a hollow cylinder of outer radius r Q and inner radius r, about 
its axis (Figure 4.3) is found by subtracting the moment of inertia of the ‘missing’ inner cylinder 
(of mass m ; ) from the moment of inertia of the solid cylinder (of mass m Q ): 

./hollow ./solid ./missing 


= ^m 2 pt)r 2 - |(irr i 2 p/)r i 2 

= f P^o 4 - h 4 ) 
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which can be expressed as: 

Jhoilow = f pl(r 0 4 - h 4 ) 

= f PKro 2 ~ rd(r 0 2 + fi 2 ) 

= F»hoiiow<>o 2 + h 2 ) (4.8) 

Superficially this appears to give a different answer from the formula relating to thin-walled 
cyhnders (Figure 4.2(b)), but for such a cylinder r 0 and r, tend towards being equal as the wall 
thickness tends to zero, and Jhoilow approaches the familiar value of mr 2 . 



Figure 4.3: Combining inertias 


Learning summary ^ 

By the end of this section you should have learnt: 

/ the similarities and differences between linear and rotational inertias and how they are analysed; 

/ the concept of moment of inertia; 

/ to calculate moment of inertia for simple components made up of cylinders and 
tubes. 

V / 


4.4 Geared systems 

In order to extend the above concepts to geared systems, the concept 
of a gear ratio must also he defined (Figure 4.4): 


. N 2 CO! 04 U 

Gear ratio n = ~rr = — = — = ~r~ 

U>2 a 2 fit 


(4.9) 


where Nj and N 2 are the number of teeth on the gears mounted 
on the input and output shafts respectively, is the angular 
velocity (in rad/s), a, is the angular acceleration (in rad/s 2 ) and 
L, is the torque (in Nm) for the input shaft, with corresponding 
terms (o 2 etc. referring to the output shaft. Note that equation 
(4.9) is also valid if the angular velocities are expressed 
consistently in other units e.g. rev/min or rev/s; this validity does 
not extend to equations that involve calculations of actual values 
of power. A similar argument applies to accelerations: calculations 
relating inertias to accelerations must be performed using rad/s 2 
as the unit of angular acceleration because equation (4.2) is not 
valid if units such as rev/s 2 are used. 



speeds etc 


250 


Electromechanical drive systems 


The angular accelerations experienced by the different shafts are related in just the same way 
as are the speeds (angular velocities). The last term in equation (4.9) comes from the law of 
conservation of energy for a perfectly efficient geared system: 
power out = power in 


Lo( jo 2 — 


so that input torque L, can be calculated from L 2 





(4.10) 

(4.11) 


The same theory relates to other toothed drive systems, specifically sprocket/ chain systems 
(such as those used in bicycles, motorcycles and some industrial situations) and toothed pulley/ 
belt systems, used for the camshaft drive in cars and within numerous motion control and 
positioning systems in industry and within more light-duty systems such as traversing the 
heads of inkjet printers and moving the scanning sensor on photocopiers. All of these may be 
regarded as synchronous drives in that the drive ratios are exactly determined by the numbers 
of teeth on the driving and driven components. One difference between gears and other drive 
systems such as chain drives and toothed belt drives is that gears reverse the direction of drive 
while belts and chains normally do not. 


Worked example 

A machine is driven at 35 rad/s with a torque of 30 Nm.The drive is provided 
via a transmission with a 4:1 reduction ratio drive. What is the combination 
of torque and speed required at the input to the transmission system? Assume 
that the transmission is perfectly efficient. 

For speed: 


I 


w 2 

where n = 4, w 2 = 35 rad/s 
oii = 4to 2 = 4 X 35 = 140 rad/s 


L2 

Lj = 30 Nm 


Broadly similar analysis can be performed for friction-based drive 
systems, though these are not synchronous because the components 
no longer possess teeth and some degree of slippage can take place 
between driving and driven components. The nominal drive ratio is 
now determined by the ratio of the effective radii (or diameters) of the 
components (Figure 4.5): 




coi _ oti 
(jl> 2 a 2 


(4.12) 


The most commonly encountered friction-based transmissions are 
vee belt/ pulley systems (used extremely widely, an everyday example 
being the drive from car engine to alternator), flat belt systems (less 
widely used today; examples include belt-drive turntables used for 
playing vinyl LP records), and systems based upon friction rollers 
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(widely used at one time in audio tape and cassette recorders, and still used within fairground 
and amusement rides, for example for ‘gearing’ rotating carriages to the rotation of the frame 
which carries them, or to provide the impetus to swinging ‘boats’ from a motorised wheel/ 
tyre running on the bottom of the boat). In the cases of vee belts, flat belts etc., care must he 
taken to understand the concept of the effective radius or pitch circle radius, since this may be 
significantly different from any of the measurable radii of the pulley (Figure 4.6). 



Figure 4.6 (a) Flat belt and (b) toothed belt showing effective radius 


Worked example 

A centrifugal blower needs to be driven at 3000rev/min from a motor that 
runs at 1460 rev/ min. If the effective diameter of the pulley on the blower is 
100 mm, what should the effective diameter of the pulley on the motor be? 

d 2 

d\ m 2 

d 2 = 100 mm, w, = 1460 rev/min, u> 2 = 3000 rev/min 
a> 2 3000 

d x = ~^d 2 = (ggi x 100 = 205.5 mm 


Drive ratio 
where 


An interesting variant of the friction-based drive is the variable-ratio drive. Various mechanisms are 
available, including systems based on vee pulleys of variable width (and hence of variable effective 
radius), and tilting spheres contacting conical rollers (Figure 4.7) so that the spheres behave as rollers 
with varying effective radii. Hydraulic variable-ratio drives also exist — see Section 4.12. 



Figure 4.7 Sphere-based variable speed drive (a) set to increase speed; (b) set to decrease speed; 
(c) effective radii of contact of spheres against the two cones 


Compound drive trains 

In many engineering situations it is not practical to obtain the required 
drive ratio via a single stage of transmission. In this case, multiple stages 
are used, with the overall drive ratio being the product of the drive 
ratios of each stage in the transmission system. 

For example, the drive ratio of the gear train shown in Figure 4.8 is: 

N 2 Na N 6 (Di oq 
Drive ratio n = X — X — = — = — 



N 3 N 5 w 6 a 6 


Figure 4.8 A compound drive train 
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Efficiency of geared systems 


While real gears, toothed belts, chains etc. scale kinematic quantities (rotational angle, angular velocity, 
angular acceleration) according to the drive ratio, frictional forces and torques result in inefficiencies, so 
the ratios of torques do not in fact precisely follow the relationships given in equations (4.9) and (4.10). 
These equations may therefore be rewritten to take account of the efficiency T| of the drive train: 
power out = efficiency X power in 

i.e. T2W2 = T l^i a) i (4-14) 


so that input torque L x can be calculated from output torque L,: 
I ^ 2 (o 2 L 2 N\ L 2 

1 TIO)! T ]N 2 "(\n 


(4.15) 


Efficiency is always less than unity, so input power is always greater than output power, with the 
difference being lost as heat. Typical efficiencies for geared systems may range from values close to unity 
(e.g. 0.95) for well-designed single-stage gear systems, down to much lower values (less than 0.5) for 
very high-ratio systems such as worm gear drives, which give a very compact reduction drive at the 
expense of involving significant sliding action and frictional losses. It must be emphasized however that 
kinematic quantities are unaffected by inefficiencies, so the following relationship still applies: 


Gear 


N 2 (1)1 aj 

jV, w 2 a 2 


(4.16) 


In a compound gear system, the overall efficiency of the gear train is the product 
of the efficiencies of each stage. 


Worked example 

A vehicle is driven from an electric motor via the compound gear 
system shown in Figure 4.9, where N t = 20, N 2 = 100, N 3 = 25, 

N 4 = 75. The efficiency of the first stage is 94 per cent and of the 
second stage is 96 per cent. If the final drive requires a torque of 
100 Nm at 200 rev/ min, what torque and speed are required from 
the motor? 

Adapting equation (4.13) to the present situation: 

No Na oil 

Drive ratio « = ITT X TT = ~ 

Nj N 3 co 4 

where « 4 = 200 rev/ min 


No Na 


100 „ 75 s 


••• ^ = N 4 X 4 X “4 = ^X^X 200 

= 3000 rev/min 

Overall efficiency is the product of efficiency of the tv 
fi = fit X ti 2 = 0.94 X 0.96 = 
Adapting equation (4.15) to the present situation gives: 
L a 

u = — 

1 r\n 

where L 4 = 100 Nm 

N 2 N 4 100 

and K = Nj X N3 = ^ X : 

t 100 



Figure 4.3 Compound gear s) 

3 stages so: 

0.902 or 90.2% 


so that 


' 0.902 X 15 ' 
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Inertia of a geared system 

Consider two shafts 1 and 2 (Figure 4.10) geared together with a perfectly 
efficient transmission of gear ratio n, with shaft 2 having mounted on it 
moment of inertia J 2 . Shaft 1 is driven by a motor, which causes the 
shafts to accelerate with angular accelerations oq and 
a 2 respectively. 

The motor is assumed to provide a torque 
JLj to shaft l.This torque is scaled up by 
the gear ratio n and transmitted to shaft 2 
causing inertia J 2 to accelerate with 
angular acceleration a 2 . Because the shafts 
are geared together, the motor observes 
shaft 1 as accelerating with angular 
acceleration oq. Overall, therefore, the 
motor is providing a torque Lj which is 
causing an angular acceleration a, to the 
machine being driven. As far as the motor is 
concerned, this is simply an inertia load Ji ' defined a; 



Figure 4.10 (a) An inertia driven via 
gears and (b) its inertia referred to 
the input shaft 




(4.17) 


where (in general) the notation J' will he used for the referred or apparent inertia typically 
involving an indirectly driven inertia load. In the present case the load actually consists of 
inertia J 2 driven via the gear train of ratio n, experiencing a torque Z^n. This causes J 2 to 
experience an angular acceleration of a 2 = (Lj n)/J 2 which is observed at shaft 1 as a, = a 2 n 
= (Lyn 2 )/J 2 , It can therefore he shown that the inertia J\ experienced by the motor is given by: 

J'l = ^ (4-18) 


Of particular note is the n 2 term, and the fact that inertias are scaled by the square of the 
gear ratio, rather than the gear ratio itself, as for kinematic quantities as in equation (4.9). 

By extension, a geared system with inertia J\ on shaft 1 geared to shaft 2 carrying another 
inertia J 2 (Figure 4.11) will involve the sum of J\ and the referred inertia due to J 2 , in other 
words: 

fi=Ji+^ (4-19) 
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Worked example 

A machine (Figure 4.12) consists of two 
shafts A and B, each fitted with a fh 
of inertia J A = J B = 1 kg.m 2 . Shaft 
driven from shaft A by a gear train 


observer, what does the system 
feel like if a handle is 
attached to shaft A 
(i.e. what is the inertia of the 
system referred to shaft A)? 


All the above concepts are applicable to pulley/belt and sprocket/ chain drivetrains. 


Influence of efficiency on the apparent inertia of geared systems 


It has already been shown that practical geared systems (and similar systems) are not perfectly 
efficient, and this has important implications for considering the practical effects of inertia 
in such systems. For an imperfectly efficient version of the example shown earlier, torque L x 
results in inertia J 2 experiencing a reduced torque T|L,n and hence undergoing a smaller value 
of angular acceleration (t)L, n)/J 2 . Thus the ratio of torque to angular acceleration on shaft 1 
can be shown to be: 

L, LJ 2 J 2 J'i 

~ = = ^2 = — (4.22] 

oq r\Lyn r\n z T] 

Note that the true value of referred inertia Jy is not affected by the inefficiency term - one cannot, 
for instance, store more rotational energy in a geared flywheel by connecting it via an inefficient 
gear train! However, it is useful to realize that a heavy system being accelerated via an inefficient 
transmission system will appear to have a higher value of inertia than if the transmission system were 
efficient. Conversely, it will appear to have a lower value of inertia when it is decelerating. 




= \ -| - -^2 = 1.11 kg m 2 (4.20) Figure 4.1 A geared system driven from shaft A 


Worked example 



The handle is instead attached to shaft 
B (Figure 4.13). What does the system 
feel like now (i.e. what is the 
inertia of the system referred 
to shaft B)? 

Note that since the first shaft is now 
B and the second is A, the gear ratio 
n = N 2 /N x is now 1:3 which is 
used in the calculation as 1/3: s' 


-/total _ _Jb + n 2 ~ 1 + (1/3)2 


as seen by shaft B 


= 1 + 3 2 = lOkgm 2 ( 4 . 21 ) 


(4.21) figure 4.13 A geared system driven from shaft B 
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Learning summary 

At the end of this section you should have learnt: 

/ the concept of a gear ratio; 

/ to relate angular velocities, angular accelerations and torques between input and output shafts of a 
geared system (or other system involving belts, friction drives etc.); 

/ the concept of ‘referred inertia’; 

/ the effect of inefficiency on the transmission of torque and on the apparent value of referred inertia. 


4.5 Tangentially driven loads 

There are numerous situations where rotational motion is converted to linear motion or vice 
versa, such as: 

• tangentially driven systems involving belts, chains and ropes 

• vehicles, which are also a form of tangentially driven load 

• screw-driven systems (e.g. machine tools, testing machines and presses) 

• mechanisms for converting between rotational and reciprocating motion such as cranks and 
cams. 

Within the present analysis we will concern ourselves only with systems that give a constant 
relationship between rotational and linear motion, namely tangentially driven and screw-driven 
systems. Tangentially driven systems will be covered initially, and screw-driven systems will be 
also be covered briefly, Cranks, cams etc. involve a non-uniform relationship between rotation 
and displacement, and a rigorous treatment is beyond the scope of the present discussion. 

There are numerous situations in engineering where a rotating shaft causes continuous linear 
motion. Such situations are generally referred to as tangential drives, and examples include: 

• conveyer belts 

• toothed belt linear drives, e.g. the positioning of print heads on inkjet printers 

• cranes, winches and hoists 

• vehicles. 

Indeed, any belt or chain drive involves the tangentially driven mass of the belt or chain itself. 

Tangential drives via belts, chains etc. 

Consider the simplest tangential load situation where a mass m 
is driven by a tight pulley of effective radius r via a tight belt or 
cord (Figure 4. 14). The inertia J' of this system referred to the 
axis of the pulley is: 

J' = mr 2 ( 4 . 23 ) 

Practical tangential drive situations involve pulleys or sprockets, 
together with belts, conveyors or chains, of non-negligible mass, 
and each of these contributes to the total inertia. For instance, 
consider a conveyor belt of mass m carrying a load of mass M, 
mounted on a pair of rollers, one of which is connected to the 
drive input shaft and has moment of inertia _/) and radius r-i, with 
the second being an idler of inertia J 2 and radius r 2 (Figure 4.15). 
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Figure 4.15 Conveyor belt system 


Each of these contributes to the inertia J' of the system referred to the input shaft: 

• the roller connected to the input shaft obviously contributes a moment of inertia J\, 

• the load of mass M is driven tangentially by the first roller of radius r\ so contributes Mr; 

• the conveyor belt of mass m is also driven tangentially by the first roller so contributes mr 2 ; 

• the idler roller is driven (via the conveyor belt) from the first roller with a pulley ratio of 
r 2 '.r\ so its inertia referred to the drive input shaft is J 2 (r\/r 2 ) 2 . 

Therefore, the total moment of inertia referred to the input drive shaft is: 

J' =J 1 + (M+ m)r 2 +J 2 (r l /r 2 ) 2 (4.24) 


Worked example 


e 



x 2 X 10 -6 kg m 2 

Figure 4.16 (a) Tangentially driven probe assembly showing belt and pulleys; (b) uniform 
acceleration/deceleration profile 


A probe assembly of mass 100 g on an item of laboratory equipment is to 
move 0.2 m in 1 second. It is driven by a toothed belt that weighs 30 g and has 
an effective radius of 20 mm where it passes over the pulleys (Figure 4.16(a)). 
A uniform acceleration/deceleration (triangular) velocity profile is to be used 
(Figure 4.16(b)). Each pulley has a moment of inertia of 2 X 10 _6 kg.m 2 

(a) Calculate the inertia of the system referred to the axis of the drive pulley. 

(b) Calculate the maximum acceleration of the probe and hence the 
maximum angular acceleration of the pulley. 

(c) Calculate the torque required to cause the desired acceleration. 


(a) y =2 X J pulley + (M + m)r 2 

= 2 X 2 X 10“ 6 + (100 X 10“ 


+ 30 X 10“ 3 ) X 0.02 2 = 5.6 X 10“ 5 kgm 2 
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(b) The probe covers 0.2 m in 1 s divided equally between acceleration and deceleration, 
therefore acceleration phase covers 0.1 m in 0.5 s from rest. 

Applying s = ut + \ at 2 gives 

2s 2 X 0,1 nQ 
a = ^ 2 = o 5 2 = 0.8ms 

Angular acceleration is therefore given by 

a °- 8 an a -2 

a = 7 = oo 2 = 40rads 

(c) Torque is given by 

L = J'a = 5.6 X 10“ 5 X 40 = 0.002 24 Nm 


Vehicles 

A very important type of tangentially driven 
load is a vehicle running on a road (or rail or 
other surface), and it can be useful to consider 
the effective inertia of a vehicle referred to 
(for example) the vehicle s axle, or the engine s 
output shaft. There are two contributions to 
the inertia J' referred to the axle of a vehicle 
(Figure 4.17), ignoring inefficiencies in the 
wheel/road contact, such as hysteresis in a 
rubber tyre: 



Figure 4.17 Simplified diagram of a vehicle 
showing contributions to inertia referred to axle 


mi 2 where m is the overall mass of the vehicle (including the wheels) and r is the effective 
radius of the wheels where they contact the road 


• A(/ w where N is the number of wheels and J w is the inertia of each wheel (whether the 
wheels are direcdy mounted on the axle or not — coupling between the wheels is provided 
via their contact with the road or rail) 


This gives 


J' = mr 2 + NJ W 


(4.25) 


In practice, the contribution due to the wheels is likely to be small compared with that of the 
overall mass of the vehicle. 


Screw drives 

These provide an alternative to tangential drives where 
only a small movement is desired for each 
revolution of the input shaft. They are widely 
used in machine tools for accurate positioning 
of slideways (where they are known as 
leadscrews), and within the aerospace industry 
for actuation of landing gear. 

The apparent moment of inertia of a screw-driven system (Figure 4.18), referred to the 
leadscrew axis, depends on the leadscrew pitch p, the mass M being driven and the efficiency T) 
of the leadscrew as well as the moment of inertia J s of the screw itself. The torque for a given 
angular acceleration a of the screw is therefore: 

+ (4 - 26 ' 

Unless the load is very heavy or the pitch is large, the biggest contribution to the total moment 
of inertia is often from the inertia J s of the screw itself. 



Figure 4.18 System driven by a leadscrew 


258 


Electromechanical drive systems 


Learning summary 

By the end of this section you should have learnt: 

/ to calculate the referred inertia of a tangentially driven system 
/ the inertia behaviour of screw-driven systems. 


4.6 Steady-state characteristics of loads 

So far, the analysis has only considered the effects of inertia. With this theory alone, it could 
be assumed that, once a machine has been accelerated to its running speed, it will carry 
on running for ever with no further input of energy. Clearly this is not true, and it is now 
necessary to consider the various contributions to frictional losses that tend to oppose 
movement when the machine is running. 


Frictional effects and losses 

Three types of frictional losses are typically encountered within machines: 


Coulomb friction 

This is based on Coulomb’s well-known law for contacting bodies (Figure 4.19) 
that are able to move linearly with respect to each other, stating that the limiting 
frictional force F for two bodies experiencing contact force N is given by 

F = pJV (4.27) 

In fact, two distinct regimes can be considered: 



Figure 4.19 Block in contact with 
a surface 


• static friction (‘stiction’) , the force required to get two bodies moving with respect to each 
other; in effect this is the force required to break any intermolecular or interatomic bonds 
that have built up between the bodies over time; 


• sliding friction or dynamic friction, a frictional force between two objects that are already 
sliding over each other; this remains (in theory) constant irrespective of the relative speed of 
the two objects. 


In practice, the coefficients of static friction, (jt s , and dynamic friction, p, D , tend to be noticeably 
different (it takes a slightly smaller force to keep two components moving with respect to each 
other, compared to the force required to get them to move). Typical values might be |x s =1.5 
|X| > However, as stated above, (Jt D may be assumed to be independent of relative speed, so the 
force-speed relationship may be represented graphically as shown in Figure 4.20: 


Ft 

|x D iV 


Sliding friction force is assumed 
to be independent of speed 





Figure 4.20 ‘Stiction’ and dynamic friction for linear 
movement 
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Mathematically, this can be represented for the static situation as: 



— F s =£ F =£ F s for v = 0 

(4.28) 

where 

F s = p s N 

(4.29) 

and for the dynamic situation: 

F = F d for v > 0 

(4.30) 

and 

F = (-Fd) for v < 0 

(4.31) 

where 

Fd = 

(4.32) 

While components undergoing linear sliding certainly are encountered in engineering (for 
example machine tool saddles moving with respect to the stationary bed, pistons sliding 
in cylinders), the above concepts are equally applicable to rotational movement, and the 
characteristics shown earlier may be represented graphically in Figure 4.21 and 
mathematically as: 


— Ls =£ L =£ Ls for w = 0 

(4.33) 


L = L d for o) > 0 

(4.34) 

and 

L = (— L d ) for a) < 0 

(4.35) 



Sliding (dynamic) friction torque is 
assumed to be independent of speed 


Figure 4.21 ‘Stiction’ and dynamic friction for rotational 
movement 


Viscous friction 


Viscous friction is caused by the shearing of a Newtonian fluid, e.g. a lubricating oil that 
separates the two bodies that are ‘sliding’. Newtons model of viscous friction states that the 
shear stress on a fluid is proportional to the shear strain rate of the fluid. For a linear bearing 
properly lubricated with a Newtonian fluid, this implies that the viscous friction force is 
proportional to the relative velocity of the two components. For linear motion of two plates of 
area A separated by distance d the force F required to maintain a relative velocity v (sometimes 
known as the ‘drag force’) is: 


F 


Av\l 

~d~ 


(4.36) 


where here p is used to represent the viscosity (strictly the dynamic viscosity) of the fluid. 
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This gives a simple linear relationship between force and velocity which can represented as: 

F = a\V ( 4 . 37 ) 

This can be shown graphically (Figure 4.22) as: 



Again, it is useful to represent this in rotational form (Figure 4.23): 

Locu ( 4 . 38 ) 


L = b t <o ( 4 . 39 ) 



movement 


Windage 

Windage is a force caused by aerodynamic effects such as turbulence, e.g. drag on a car, 
windage drag on a motor due to its cooling fan and to the ‘churning up’ of air in its casing. The 
magnitude of windage force is proportional to the square of velocity, noting that the direction 
of the force opposes the direction of movement. 

| F | xv 2 ( 4 . 40 ) 

This is represented graphically in Figure 4.24. 

For a body of frontal area A moving through fluid of density p with velocity v the windage 
force it experiences is: 

I F I — jpv 2 AC d ( 4 . 41 ) 
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where Cj is an approximately constant value called the coefficient of drag, which depends 
upon the shape of the body, and is in the order of 1 or slightly larger for ‘bluff’ bodies (such as 
cubes, plates etc.) going down to much lower values for aerodynamically designed, streamlined 
shapes such as aircraft. Such coefficients are widely tabulated in the fluid mechanics literature 
and are presented in Unit 1 of the present book. In simplified form this kind of relationship 
can be represented as 

\F\ = a 2 v 2 (4.42) 

In a similar manner for rotation, torque is proportional to the square of angular velocity: 

\L\ oc a) 2 (4.43) 

\L\=b 2 h> 2 (4.44) 

This can be represented graphically in Figure 4.25. 



Summary of frictional effects 

For linear motion, assuming motion in a positive direction: 

• at rest/ threshold of movement: F = F s = static friction force 

• at velocity v: F = F n + a^v + a 2 v 2 

i.e. total frictional force = Coulomb friction force + viscous friction force + windage. 

For rotation in a positive direction: 

• at rest/threshold of movement: L = L$ = static friction torque 

• at angular velocity (o:L = L D + fc 1 (o + b 2 co 2 

i.e. total frictional torque = Coulomb friction torque + viscous friction torque + windage. 
Note that for a tangentially- driven system the force-speed relationship can be transformed to a 
rotational characteristic by substituting T = Fr and v = wr, giving (for example): 

L = F D r + a\ r 2 (ti + a 2 r 3 u> 2 
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Characteristics of fans, blowers, pumps etc. 


Not surprisingly, windage-type effects are the predominant contribution to the torque-speed 
characteristics of rotodynamic fans and blowers. Broadly similar characteristics are also 
encountered for similar devices such as centrifugal pumps, such as those encountered in many 
chemical plants. It should be noted that, in approximate terms, the pressure that can be supplied 
by rotodynamic pumps, blowers, fans etc. is also proportional to the square of the speed. 
However, an important issue is that if the torque required to drive such a device is 
proportional to the square of the speed, then the power (which is given by torque X angular 
velocity) will increase with the cube of the speed. This has very important implications for 
energy efficiency, in that considerable savings can be made by reducing the speed of 
such devices to the minimum which will deliver the desired performance, rather than 
throttling their flow rates or allowing them to supply higher pressures and flow rates than 
are required. 

Note that the issues surrounding the torque-speed characteristics of positive-displacement 
pumps, compressors etc. are rather different. The torque required to drive them is dominated 
by the pressure or head against which they are pumping: if this is constant (for example, 
if supplying a header tank), the torque required (when averaged over a cycle) will not 
change greatly with speed; if the pressure changes with flow rate, then the torque will also 
change. 


Other contributions to steady-state running characteristics 

While it is clearly impossible to list all possible contributions to running characteristics, some 
common examples include gravitational load, for example due to items being carried upwards 
on a sloping conveyor, winched upward using a hoist, or where a vehicle is climbing a slope. 
Such a contribution to load torque will be independent of speed, and can be found using 
simple engineering mechanics (refer to Unit 6 of An Introduction to Mechanical Engineering: 

Part 1). 

Example: Torque on a high-speed rotating antenna 


0 1 2 3 4 

Rotational speed (rev/second) 

Figure 4.26 Contributions to torque-speed characteristics of a 
high-speed rotating antenna (Adapted from Blackburn et al., 1960) 


Figure 4.26 illustrates the characteristics of a system that exhibits all of the above contributions. 
It is interesting to note that the windage element varies with the ambient wind speed. 



-Windage in 70-knot 
(=100 km/h) gale 


- Total torque 


-Windage in still air 
-Coulomb friction 


-Viscous friction 


263 


An Introduction to Mechanical Engineering: Part 2 

Some positive aspects of friction - it wastes energy, but is it always 
a bad thing? 

Coulomb friction, viscous friction and windage all involve conversion of mechanical power 
into heat. Within mechanical power transmission situations this is normally undesirable as 
it involves wasted energy, yet friction is usefully harnessed as an essential feature of several 
engineering devices: 

• Brakes, which apply a frictional torque to oppose motion, are widely used to convert 
unwanted kinetic energy (for instance, in a vehicle) or potential energy (in the load of a 
crane or elevator) into heat, which clearly needs to be dissipated. 

• Clutches are used to provide a means of temporarily matching differing speeds while 
transmitting a given value of torque. This is achieved by frictional coupling of the input shaft 
to the output shaft via two contacting plates. The function of a clutch will be examined 
further in Section 4.13. 

• Worm gears (Figure 4.27) are widely used to provide a very large speed reduction ratio (and 
torque increase) between a rapidly rotating input shaft and a much more slowly rotating 
output shaft. This is achieved in much less space than would be possible via conventional 
(spur or helical) gears, which would require the use of compound gearing to achieve the 
required gear ratio. Unlike conventional gears, which involve primarily rolling friction 
between gear teeth, worm gears involve a primarily sliding action as the screw-like worm 
rotates against the flanks of the teeth of the worm wheel, driving it around. This friction 
results in the undesirable feature of very low efficiency (values of 30 per cent are typical for 
worm gear assemblies). However, friction also accounts for an important feature of most 
worm gear systems which contrasts with conventional gear systems: provided the helix angle 
of the worm is sufficiently shallow (typically corresponding to the use of as single-start or 
possibly two-start worm) they can normally only transmit drive in one direction through 
the power train, with the drive going only from the input shaft to the output shaft. In other 
words they are self-locking if an attempt is made to provide the drive from the output shaft 
to the input shaft. This makes them a particularly useful feature of devices such as hoists, 
cranes, torsional testing machines etc. For example, if the drive to a worm-driven hoist fails, 
the load will not descend as the worm gear assembly will lock up. Similarly, a worm-driven 

torsional testing machine will retain the torsional loading 
on the specimen if no input torque is applied. In both 
of these cases it is said that the drive system cannot be 
‘over- hauled’. There are exceptions to this: in some 
cases a multistart worm is used to provide a compact 
speed increase gear system. These are relatively rare in 
engineering — perhaps the classic example of such a 
system was within the speed-limiting governor used 
in mechanical telephone dials up to the 1970s, where 
a rapidly rotating centrifugal brake was operated from 
the slowly turning dial via a reverse-driven worm gear 
mechanism. 
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Learning summary 

By the end of this section you should have learnt: 

/ the different contributions to steady-state running characteristics of loads; 

/ to express these different contributions to the load characteristics in the form of a mathematical 
expression, both for linear and rotational motion; 

/ that friction can have a beneficial role in some situations as well as having a detrimental effect on 
efficiency in other situations. 


4.7 Modifying steady-state characteristics of 
a load using a transmission 


In order to match the torque-speed characteristics of a load to the characteristics of the 
mechanical power source that will drive it, it is often necessary to drive the load via a 
transmission of some kind, which may take the form of gears, belts, chain drive etc. as described 
in Section 4.4. It is therefore very useful to find the characteristics of the load referred to the 
input shaft of the transmission. This is straightforwardly achieved by substituting the relations 
for torque and speed given in equation (4.9) (or, if inefficiencies are taken into account, the 
torque relations given in equation (4.15)) into the equation giving the load’s torque-speed 
relationship. For example, consider a load with the torque— speed (or, strictly, torque— angular 
velocity) characteristic as follows: 


Figure 4.28 (a) Typical torque-speed curve for a directly driven machine 
involving dynamic Coulomb friction, viscous friction and windage (b) family of 
torque-speed curves 

Such a characteristic is shown diagrammatically in Figure 4.28(a). The load is driven via a 
transmission with a ratio of n: 1 and an efficiency of T), and it is desired to find the characteristic 
of the system expressed as the relationship between torque L' and speed co' at the input shaft of 
the transmission. Making use of equation (4.9), « is expressed in terms of of: 


L = L D + b t to + 


(4.45) 




(4.46) 


Similarly, making use of equation (4.15), L' is expressed in terms of L as: 



(4.47) 
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Inserting these into 


equation (4.45) gives: 

t! = — (l d + t 


~ b 2 i o 2 ) 


= — + 


(to') 2 


(4.48) 


It is instructive to observe that a family of torque— speed curves (Figure 4.28(b)) can be 
constructed as the ratio n is varied: a low value of n will cause the load to require high values 
of torque to be provided at a relatively low range of speeds, while a large value of n will require 
lower values of torque to be provided, but with the drive being at higher speeds. 

It is crucial to note that changing the transmission ratio does not alter the power requirements 
of the load - it merely allows the characteristics of the load to be moved to provide a better 
match to those of the mechanical power source. The issue of matching of load and power 
source will be explored further in Section 4.13. 


Learning summary 

By the end of this section you should have learnt: 

/ to refer the torque-speed characteristics of a load to the input shaft of a transmission system in 
order to obtain the characteristics observed by the mechanical power source driving it; 

/ that a transmission will affect the combination of torque and speed required to drive a load but will 
not help to overcome a shortfall in the power available for providing the drive. 


4.8 Sources of mechanical power and their 
characteristics 

In some cases, a machine will he driven manually via a handle, pedals etc., hut in most cases 
some other source of mechanical power will be needed. There are various categories of such 
systems, and it can be useful to distinguish them. Exact terminology can vary (see Chambers 
(2007) Science and Technology Dictionary for one set of formal definitions) but as a guideline: 

1. The term ‘motor’ is usually used to describe a machine that causes motion or generates 
mechanical power, often drawing its energy from some other easily managed form 
(electrical, hydraulic, pneumatic, possibly chemical). Within the present unit, we will consider 
a variety of electric, pneumatic and hydraulic motors. However, the term ‘motor’ historically 
also covered internal combustion engines, leading to the formal term ‘motor vehicle’ and 
related terms such as ‘motorist’ etc. 

2. Prime movers are devices that convert a natural source of energy into mechanical power; 
these can include engines which make use of burning fuel, but could also include wind 
turbines, water turbines etc. 

3. The term ‘engine’ tends to be reserved for a machine that creates mechanical power 
from heat energy; examples are internal combustion engines, jet engines (a form of gas 
turbine), steam engines etc. Within the present discussion we will pay some attention to the 
characteristics of internal combustion engines, though reference will be made to the very 
different characteristics of other forms of engines. 

In order to be able to design a drive system, it is important to understand how the torque 
available from a source of mechanical power varies with speed. With this information it is then 
possible to analyse the way in which the mechanical power source interacts with the load it is 
driving. 
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Characteristics of the internal combustion engine 

Although there is no immediate derivation of its torque-speed characteristics, the internal 
combustion engine is one of the most important prime movers. Its torque-speed characteristics 
vary depending upon the position of the throttle (on a petrol engine) or the setting of the 
fuel injection pumps (diesel engine). It is usual to give only the maximum torque-speed 
characteristics (for a petrol engine this is known as the wide open throttle (WOT) condition); it 
is not usual to give the torque-speed characteristics for different settings of the throttle etc., but 
for illustrative purposes, possible curves of this type are included in Figure 4.29. (Note that it 
is also usual to present internal combustion engine 
characteristics such as those in Figure 4.29 in terms of a 
quantity known as ‘brake mean effective pressure’, a quantity 
related to torque and to the engine capacity. For simplicity, 
however, graphs of torque and speed are plotted directly here.) 


A particular feature of internal combustion engines is that 
they cannot run in a sustained manner at low speeds: instead 
they stall, a phenomenon very familiar to learner drivers! 
Rather than represent the infinite set of characteristics, it 
is usual to represent instead the maximum possible torque 
(under wide open throttle conditions etc.) for each speed, 
and to fill the area under the curve with a ‘map’ showing 
the specific fuel consumption (SFC) of the engine for each 
combination of torque and speed (Figure 4.30 a and b).This 
gives the amount of fuel consumed by the engine per unit of 
mechanical energy produced (usually expressed in g/kWh) It 
may be noted that there is an optimal combination of torque 
and speed (close to the maximum torque value, but well 
below the maximum safe running speed) at which the engine 
runs at maximum efficiency, consuming the least amount of 
fuel for a given output. 


Torque-speed characteristic under 
wide open throttie (WOT) condition 


o 



Efficiency decreases away from 
optimum operating point 


Figure 4.30 (a) Features of a torque-speed-SFC map 
for an internal combustion engine; (b) torque-speed- 
SFC map for a 1.9-litre internal combustion engine 
(adapted from Shayler et al., 1999) 


Engine will not run in a 
stable manner below this 

speed (will stall) Torque-speed characteristic under 

\ wide open throtde (WOT) condition 

.Damage may occur 
above this speed 


Possible torque- 
speed characteristics 
for low throtde settings 


Figure 4.29 Hypothetical set of torque-speed 
characteristics for an internal combustion engine at 
different throttle settings 




Engine speed (rev/min) 


The great advantage of the internal combustion engine is its ready availability and 
independence from other sources of power; its most obvious applications are in motor 
vehicles, ships and railway locomotives. In addition, diesel engines in particular are often used 
for powering generators for medium-sized communities and for standby pumping or power 
generation applications. However, rather than using a prime mover such as an engine, it is often 
far more appropriate to use some kind of motor which is supplied with power in the form of 
electricity or pressurized fluid and delivers mechanical power locally in a controllable way. 
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Learning summary 

By the end of this section you should have learnt: 

/ the different types of mechanical power sources used within drive systems; 

/ the meaning of a torque-speed-SFC diagram for an internal combustion engine, and in particular 
understand the implications of its main features. 


4.9 Direct current motors and their 
characteristics 

Electric motors provide the drive for many mechanical systems, and generally provide the most 
convenient source of power. Each type of motor has a different torque— speed characteristic 
with some (e.g. dc shunt or permanent-magnet motor, or ac induction motor) providing 
good running characteristics, and others (e.g. dc series motors) providing much better starting 
characteristics. For each application the engineer must choose a machine that will provide 
the desired torque over the full speed range of the mechanical system. This section examines 
direct current (dc) motors, which are divided into five categories: shunt, series, compound, 
separately excited and permanent magnet motors. Shunt motors run at relatively constant speed 
and are ideal for use with conveyors, pumps, compressors and rolling mills. Series motors are 
used for traction, lifts and other applications where a large starting torque is required, and can 
also be used with ac supplies (when they are known as ‘universal’ motors). Compound motors 
combine the characteristics of shunt and series motors, while separately excited motors provide 
accurate speed control. Permanent magnet motors have similar characteristics to shunt wound 
motors but make use of permanent magnets to provide the stationary magnetic field, rather 
than generating the field electromagnetically. 

In order to understand how dc motors work, it is necessary to have an understanding of 
electromagnetism and to be familiar with the basic equations associated with such systems. These 
prerequisites are covered in Part 1 , Unit 5 and the reader is encouraged to refer to that material. 

Because this section (along with Sections 4.10 and 4.11) overlaps the disciplines of engineering 
mechanics and electrical systems, there is a conflict of notation in that mechanical engineers 
use the symbol L to represent torque, while electrical engineers use L to represent inductance. 
In order to avoid confusion within the present section and Sections 4.10 and 4.11, the word 
‘torque’ will generally be used explicitly in preference to the use of a symbol, though the 
symbol T will occasionally be used for torque within a formula. Also, the symbol n will be used 
to represent speed in rev/ min. 


Construction of dc motors 

The basic dc motor has two parts; a stationary field and a rotating armature. The field generated 
either by electromagnets constructed of steel laminations, with salient poles, around which coils 
are wound or by permanent magnets, which replace the poles and associated coils. Figure 4.31(a) 
shows a four-pole motor with two N-poles and two S-poles (indicating north-seeking and south- 
seeking poles respectively). Direct currents flow in the coils creating a static magnetic field which 
links with the armature winding. The armature has circular steel laminations fixed to a central 
steel shaft. Slots are cut in the outer circumference as shown in Figure 4.31(b). Also fixed to the 
shaft, adjacent to the steel core is a commutator constructed of copper segments that are separated 
by thin sheets of insulation. Coils are laid in the slots in the steel laminations and their ends brazed 
to individual commutator segments. Carbon brushes connect the commutator to an external 
electric supply. Figure 4.31(c) shows a partially completed motor. 
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Figure 4.31 Construction of a dc motor: 
(a) cross-section through a four-pole 
motor; (b) simplified diagram of a single 
armature winding; (c) partially completed 
motor showing relationship between 
armature and stator poles 


Operation of dc motors 

The simplest dc motor has two poles, an armature with 
two conductors, 1 and 2, forming a single coil and a 
commutator with two segments, as shown in 
Figure 4.32. 

An external direct voltage source supplies a 
current, Ij to the field winding. This establishes 
a magnetic flux, 4> between the two poles. 

The flux passes through both air and steel. 

Its magnitude is a function of the field External 

current, Ip supply 

()) = Fn (I f ) (4.49) 

Because the B-H (flux density vs magnetic 
field strength) characteristic for steel is 
non-linear the relationship between 4> and 
Ij is also non-linear. Another direct voltage 
supply feeds current, I a to the armature 
winding via the carbon brushes. The armature current produces subsidiary magnetic fields 
that circulate around the armature conductors. Figure 4.33(a) shows a cross section of the 
conductors encircled by the subsidiary fields and the main field passing from the N-pole to 
S-pole.The subsidiary fields interact with the main field to produce the resultant field shown 
in Figure 4.33(b). This produces forces on the conductors, which create a torque on the 
armature making it rotate. 



Figure 4.32 Simplest form of dc motor showing armature and stator 
windings 


By supplying current to the armature via carbon brushes and commutator the currents flowing 
through the individual armature conductors reverse every time the coil is perpendicular to the 
main field, maintaining the torque in the same direction. Figure 4.34(a) to (e) show the change 
in current direction as the armature rotates. In Figure 4.34(a) and (b) the current in conductor 
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Figure 4.33 Generation of forces on a pair of 
conductors carrying current into the page ((£>) 
and out of the page (Q) 


1 flows ‘into the page’ and the current in conductor 2 flows ‘out of the page’ making the coil 
rotate anticlockwise. When the coil passes the horizontal (Figure 4.34(c)) the currents in the 
two conductors are reversed by the switching action of the commutator to maintain the torque 
and hence the rotation in the same direction. 


w w w 


0 rV1 F o rr1 0 hr1 

Figure 4.34 Forces on armature windings at different armature orientations 



Emf induced in the armature winding 

As the armature coil rotates, an emf is induced in it which tends to oppose the externally 
supplied armature current, I a . The external supply voltage must overcome this emf if the 
machine is to motor and deliver mechanical power through the shaft. 

Figure 4.35 shows the variation in flux linkage between the main field and the coil as the latter 
rotates. In Figure 4.35(a), when the coil is perpendicular to the field, there is maximum flux 
linkage, i|i m . With only one coil this equals the flux between the poles, 4>. 



Figure 4.35 Variation in flux linkage between the main field and armature coil 
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When the coil is parallel to the field (Figure 4.35(b)) the flux linkage is zero. As the coil 
continues to rotate, the direction of flux linkage is reversed. In Figure 4.35(c) the coil is at an 
angle 0 to the field. At this point 

flux linkage, i|/ = i|i m sin 0 

Taking the time when the coil is parallel to the field (Figure 4.35(b)) to be zero and assuming 
that the coil rotates at to rad/s, the time, t when the coil is at angle 0: 

t = 0/w 


and the flux linkages, i|r at that time are: 

i|i = i|; m sin ®f 

The emf induced in the coil is, according to Faradays law (see Part 1, Unit 5), equal to the rate 
of change of flux linkages. 

d4> 

e = ~dt 

so, the emf induced in the coil at time, t = 0/o) is: 

e = »|/ m « cos g ot 

The coil continues to rotate to (tt/ 2) , Figure 4.35(d), arriving at time, t = tt/ (2w). 

If s = average emf induced in the coil as it rotates from parallel to the field (Figure 4.35(b)) to 
perpendicular to the field (Figure 4.35(d)) then: 

£= mM ^ tocos cot dt 

[2®] 

s=^Jsmcotr“ 

2 co . 

s = — tu m 

TT 

but \\i m = <j> 


Many machines have several pairs of poles. With two N- and two S-poles the motor shown 
in Figure 4.31 has two pairs of poles. For a given speed of armature rotation, additional poles 
reduce the time the coil links with flux from a particular pole. This affects the induced emf. For 
a machine with p pairs of poles the coil only links with the magnetic flux from each pole for 


(pci)) secon ds. If the flux per pole is 4> then the average emf induced in the coil, s is given by: 

In practice an armature winding will have many coils connected in series and the total emf 
induced in the armature winding is given by: 


total emf induced = average emf induced X number of coils 

in armature winding in one coil in series 

E = 2(0 X A s 

TT 

Both the number of poles (2 p) and the number of armature coils in series (A s ) are constant for 
a particular machine, so it is convenient to combine these two parameters into a single design 
constant, k, where 


k = 


2pA s 

TT 
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and express the total emf induced in the armature winding, E as: 

E = fe4»co [V] (4.50) 


Torque 


The electrical power delivered to the armature is converted to mechanical power, creating a 
torque, which makes the armature rotate. 

Assuming that there are no losses in the system: 

electrical power delivered to armature = mechanical power at the shaft 

Pa = Pm (4-51) 

electrical power = armature emf X armature current 

P c = EX I a (4.52) 

mechanical power = torque X angular velocity 

P m =TX oo (4.53) 

Combining equations (4.51) and (4.53) 

R 

T = — (4.54) 

oo 

Torque = Electrical power delivered to the armature/Angular velocity 
Substituting (4.50) and (4.52) into (4.54): 


Torque = 


giving: 


Torque = [Nm] 


(4.55) 


In practice there will be some losses in the system due to friction at the bearings, windage 
and iron losses in the steel core. These reduce the mechanical torque produced at the shaft. 
However, as these losses are usually small, they will be ignored for the rest of this section, and it 
will be assumed that the torque calculated using equation (4.55) equals the mechanical torque 
delivered at the shaft. 


Armature equivalent circuit 

To analyse the operation of dc motors it is convenient to represent the armature winding by 
the equivalent circuit shown in Figure 4.36. E is the emf induced in the armature and R a is 
the armature winding resistance, which is usually quite small. The armature terminal voltage is 
given by: 

V a = E + I a R a (4.56) 


Voltage 

applied 


armature current, I. 


armature 
resistance, R_, 

^ | Induced emf, E 
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Figure 4.36 Equivalent circuit for the armature 
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Types of dc motor 

There are five types of dc motor: 

(1) shunt motor 

(2) series motor (also used with ac supplies as ‘universal motor’) 

(3) compound motor 

(4) separately excited motor 

(5) permanent magnet motor. 

These are shown in Figure 4. 37. The shunt motor (a) has the field and armature windings 
connected in parallel with the supply voltage. In the series motor (b) the two windings are 
connected in series. Compound motors, which have two field windings, are divided into 
two subcategories: the long-shunt motor (c) and the short-shunt motor (d). In the long- 
shunt motor the parallel field winding, _R fp is in parallel with the series field winding, R fs and 
armature in series, while in the short-shunt motor, Rg is in series with the parallel combination 
of R {p and armature. The separately excited motor (e) has independent voltage supplies to the 
field and armature windings. The permanent magnet motor uses permanent magnets rather 
than field windings to provide the stationary magnetic field. 



Series field winding 
resistance, R fs 


Long-shunt compound motor 



Short-shunt compound 


G 

Voltage applied 
to field winding, 

V f 




Separately-excited motor 


Figure 4.37 Different configurations of dc motor: (a) shunt motor; (b) series motor; (c) long- 
shunt compound motor; (d) short-shunt compound motor; (e) separately excited motor 


The direction of rotation of any dc motor is reversed by swapping the connections to either 
the armature or field windings, but not both. In the case of a permanent magnet motor, only 
the armature connections exist; swapping the connections to this will reverse the motor. 
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DC shunt motor 

As stated above, the dc shunt motor (Figure 4.37(a)) has the armature and field windings 
connected in parallel with the supply voltage. To withstand the applied voltage, the field 
winding is constructed of a large number of turns of thin wire with a resistance of hundreds of 
ohms. The armature winding has a large cross sectional area and a small resistance, usually less 
than lft. 

From equation (4.50) the emf induced in the armature winding, E is given by: 

E = fecjia) 

but from equation (4.49) 4* is a function of the field current, If 
Hence 

f = *4> = Fn(J f ) 

fe4> versus If is normally presented as a graph, called the motor field characteristic. This graph is 
used to find the motor torque versus speed characteristic. The worked example demonstrates 
how. 


Worked example 




Figure 4.38 Characteristics of a dc motor: (a) motor field characteristic; (b) torque 
versus armature current; (c) angular velocity versus armature current; (d) torque 
versus angular velocity 

A 220 V dc shunt motor has an armature resistance of 0.8 Q and field winding 
resistance of 220 Q. The motor field characteristic is shown in Figure 4.38(a). 
Calculate the field current. If and the corresponding value for k§. 

The motor drives a load torque of 10 Nm. Calculate the armature current, I a , 
and speed, n. 
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The motor field characteristic is used to find the value of fe(J>- 
When Ij = 1A, k|> = 0.84 Vs/rad. 

With the field winding connected directly across the supply voltage, the field current, and 
hence the value of k§, is constant for all loads. 

Using equation (4.55) 

Torque = A’4> 


when T = 10 Nm, 


Using equation (4.50) 

E = /e4>0) 

and substituting into equation (4.56) 

V = fe4>w + / a R a (4.57) 

220 = 0.84m + 11.8 X 0.8 
220 = 0.84m + 9.4 

220 - 9.4 = 210,6 
“ 0.84 0.84 

m = 250.7 rad/s 

Although, we do the calculations in radians per second it is usual to express motor speed 
in revolutions per minute, n, where: 




n = x — X 60 = -v — rev/min 

Ztt Ztt 

In this case, n = 2394 rev/ min. 

By repeating these calculations using other values of torque, the following characteristics 
can be drawn: 

• torque versus armature current - Figure 4.38(b) 

• angular velocity versus armature current — Figure 4.38(c) 

• torque versus angular velocity - Figure 4.38(d). 

Figure 4.38(d) shows that, as the torque is increased, there is only a small drop in motor 
speed, making the dc shunt motor ideal for driving machine tools, pumps and compressors. 


Torque-speed characteristics of dc shunt motors 

From the viewpoint of the design of an electromechanical system, it is useful to express the 
torque-speed relationship explicitly: 

V = fe(j)M + I ^ 

R a X torque 
= HU> + 7T 
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This can be rearranged as follows: 

k<$> V fe 2 4» 2 a> 

= R + tor q ue 

and hence the torque can be expressed explicitly in terms of speed: 

k<$> V fe 2 4> 2 a ) 

Torque — ~ ^ 

Alternatively, speed may be expressed as a function of torque: 

V R 3 R torque 
w fc4> k 2 4> 2 


(4.58) 


(4.59) 


The constant k§ (for a given value of field flux c|>) is sometimes clustered together as the 
motor constant K m giving the emf E induced in the armature, or back-emf, in terms of the 
angular velocity «: 

E=K m o) 

and the torque in terms of the current J a : 

Torque = K m l x 

This notation will be used within the unit on control theory. 


Speed control of dc shunt motors 

To find a way to control the speed of a dc shunt motor we should examine equation (4.57), 
which can be rearranged to give: 

V - I a R a 
“ “ H 

The voltage drop across the armature winding (/.,R- a ) is usually small compared to the supply 
voltage, V. But, if the latter is fixed then the motor speed is dependent on the flux. However, 
the flux is dependent on the field current. So, with the supply voltage fixed, an increase in 
field circuit resistance reduces the field current, thereby reducing the flux and increasing the 
speed. 

This is demonstrated in the following examples. The same motor is used to drive the same 
load torque. In the first example, the motor is operated without additional resistance. In the 


Worked example 

A 240 V dc shunt motor has a field 
winding resistance, Rj- of 160 Q, armature 
winding resistance, R a of 0.2 Q and a 
motor field characteristic like that 
shown in Figure 4.39. The load torque 
is 50 Nm. 

Calculate the field current, armature 
current, supply current and motor 
speed. 
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' Rt 60 

From motor field characteristic, Figure 4.39, when I c = 1.5A, k<$> = 1.91 Vs/rad 
Using equation (4.55) 

Torque, T = k<$> I 3 


Supply current, 

From equation (4.56) 
From equation (4.50) 


a 1.91 
= 26.2A 

is = i(+ 4 

= 1.5 + 26.2 = 27.7 A 

&=v-h R* 

= 240 - 26.2 X 0.2 = 234.8V 

E = 

234 8 

= 122.9 rad/s 


o=E = 


ktf> 

_ 30w 


1.91 

= 1174 rev/min 


Worked example 




O O 

Figure 4.40 (a) dc motor with shunt field winding regulator; (b) graph of 
angular velocity versus regulator resistance 


The dc shunt motor in the previous example has a shunt field winding 
regulator connected in series with the field winding, as shown in Figure 
4.40(a). The regulator resistance, R r is set at 40 Q. The load torque is 
maintained at 50 Nm. Calculate the new field current, armature current, 
supply current and speed. 
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Field current, If — 


240 

' 160 + 40 " 


1.2 A 


R f +R r ' 

From motor field characteristic, Figure 4.39, when If = 1.2A, kfy = 1.83Vs/rad 
torque 

Armature current, / a = — ^ — 

4 « Tlk - 27.3A 


Supply current, I s = If + / a 


From equation (4.56) 
_ 


= 1.2 + 27.3 = 28.5A 
E = V- 4 Ra 

= 240-27.3 X 0.2 = 234.5 V 


E — fecjjco 

E 234.5 . , , 

“ = H = T83" = 128 ' lrad/s 


n = Alhl = 1223 rev/min 

TT 

j The connection of a shunt field regulator in series with the field winding has increased 
Hi the resistance in the field winding circuit and with it the motor speed. 

| By repeating these calculations using various values for the regulator resistance, the graph 

i of angular velocity, oo, versus regulator resistance, R, can be plotted as shown in Figure 
4.40(b). 

The graph shows that the motor speed rises as the regulator resistance is increased. 


second, additional resistance is connected in series with the motor field winding. 

DC series motor 

A dc series motor is illustrated in Figure 4.37(b). It has the field and armature windings 
connected in series, so that the same current, I flows through both windings, which have large 
cross sectional areas and low resistance. 


Worked example 

A 230 V dc series motor has the flux versus field current characteristic shown 
in Figure 4.41(a). The motor design factor, k is 142, the armature winding 
resistance, R a is 0.12 Cl and the field winding resistance, Rf is 0.03 ft. 


Calculate the speed and torque when the current is 25 A. 

From the equivalent circuit, Figure 4.37(b) 

E= V-I^ + Rd 

E = 230 - I X (0.12 + 0.03) = 230 - 0.15 I 
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When I = 25 A 


E = 230 - 25 X 0.15 - 230 - 3.75 = 226.25 V 


1(A) 

o 



100 150 

n (rads -1 ) 


0 50 100 

1(A) 

o © 

Figure 4.4 (a) Flux versus field current characteristic; (b) torque versus 
current; (c) angular velocity versus current; (d) torque versus angular velocity 


From the motor field characteristic, Figure 4.41(a) when I = 25 A, cf> = 10 mWb. 
Using equation (4.50) 


' 142 X 10 X 10“ : 


From equation (4.55) 


When I = 25 A 


Torque = fee (>/ 


2tt 


By repeating the calculations with different values of current the following graphs can be 
plotted: 

• torque versus current — Figure 4.41(b) 

• angular velocity versus current - Figure 4.41(c) 

• torque versus angular velocity - Figure 4.41(d). 
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The following example shows how we can find the torque versus speed graph for this type of 
motor. 

If it is assumed that 4> is proportional to I, and if the effects of resistance are neglected, then to 
a first approximation:, it can be shown that 

Torque oc 
Torque oc 

At low speeds dc series motors create a large torque, which reduces as the speed increases, 
making this type of motor the ideal choice for lift and traction applications. Series motors 
will run at very high speeds when no load is coupled to the shaft. To prevent damaging the 
machines they must always be operated with permanently coupled loads. 

The speed of a series motor may be controlled by a variable resistor connected in series with 
the field winding. This is demonstrated in the following worked examples. The same motor is 
used to drive a constant load torque. In the first example, the motor is operated without any 
additional resistance. In the second example, a resistance is connected in series with the motor 
windings. 


Worked example 

The 240 V dc series motor shown in Figure 4.42(a) has an armature resistance 
of 0.1 Q and a field winding resistance of 0.05 Q. The motor field characteristic 
is shown in Figure 4.42(b). The motor drives a load torque of 250 Nm. 
Calculate the speed. 









The first step to solving this problem is to draw the graph of torque versus current,. 

This requires selecting a number of currents on the horizontal axis of Figure 4.42(b) and 
for each point reading off the corresponding value of fccj) and calculating torque using 
equation (4.55): 
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Torque = fee \> I 

For example when I = 50A 

4> = 3.12Vs/rad 

and 

Torque = 3.12 X 50 = 156 Nm 

These calculations are repeated for several currents and the graph of torque, T versus 
current, I (Figure 4.42(c)) is plotted. 

From this graph: 
when torque = 250 Nm 

I = 78A 

E = V-I{R i + Rf) 

= 240 - 78 X (0.1 + 0.05) = 228.3V 
From Figure 4.42(b) when I = 78A 

fecf> = 3. 25 Vs/rad 

Using equation (4.50) 

E 228.3 ,. 

w = h = w = 70 - 3rad/s 

n = ~ = 671 rev/min 
2tt 


Worked example 

The dc series motor in the previous example has a resistance, R s of 0.05 Q 
connected in series with the armature and field windings, as shown in Figure 
4.43(a). The torque remains at 250 Nm. Calculate the new speed. 



*.( A) 

o o 

Figure 4.43 (a) dc series motor with additional series resistance; 
(b) graph of motor speed vs. resistance for load of 250 Nm 
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From the torque, T, versus armature current, I, characteristic for the motor, Figure 4.42(c) 
when Torque = 250 Nm, I = 78 A 

E = V- I (R a + R{+ IQ 
= 240-78 X (0.1 + 0.05 + 0.05) 

= 224.4V 

From Figure 4.42(b), when I = 78 A 

H = 3.25 Vs/rad 
E 224.4 

W= H = W =69rad/S 

n = ~ = 659 rev/ min 

Connecting a resistance in series with the field and armature windings has reduced the 
motor speed. With the same load torque further increases in the series resistance will 
make the motor run even more slowly, as shown in Figure 4.43(b). 


Series motors can be connected to ac supplies because both the armature and field windings 
have relatively low inductance. They can be used for moderate- or relatively low-power high- 
speed ac applications such as: 

• high-speed blowers for small furnaces 

• vacuum cleaners 

• high-speed woodworking tools (routers). 

These motors can also be used for moderate-power applications requiring good power-to- 
weight ratios and high stall torques such as: 

• power tools, e.g. hand-held power drills and garden tools 

• electric mixers and food processors. 

Since these motors typically run at very high speeds (e.g. 5000—10 000 rev/ min) it is usual to 
use a large reduction drive ratio to enable them to be used to drive items such as a drill bit or 
mixing agitator. A cooling fan is usually incorporated; as well as providing a means of cooling 
the motor, the windage it creates acts as a mechanical load which limits the no-load speed of 
the system to a safe value. 

Compound motors 

There are two forms of compound motor: 

• long-shunt motor 

• short-shunt motor. 

Both motors have two field windings, the shunt-field winding, fQ and the series-field winding, 
Rf s . Figure 4.37(c) and (d) show the winding connections for the long-shunt motor and short- 
shunt motor respectively. In both forms of motor, the magnetic field produced by the shunt 
field winding is larger than that formed by the series winding. Normally, the windings are 
connected so that the two fields assist, i.e. act in the same direction. This is called cumulative 
compound. The current through the shunt winding, and hence the magnetomotive force 
(mmf ) produced by it, are practically independent of the load, whereas, the mmf produced 
by the series field winding depends on the armature current and the mechanical load. When 
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the motor is on no load, the mmf produced by the series winding is negligible and the motor 
performs like a shunt motor. Figure 4.44 shows a typical set of characteristics for a cumulative 
compound motor. As the load current rises the mmf produced by the series field increases, and 
the speed which is inversely proportional to the total mmf, falls. At full load torque the speed 
is between 0.7 and 0.9 of the no-load value. Cumulative compound motors are used where 
heavy loads may be suddenly applied, for example in presses, plunger pumps and hoists. 



Armature current, I a Armature current, I z Speed, n 


DO© 

Figure 4.44 Typical characteristics for a cumulative compound motor: (a) torque vs. 
armature current; (b) speed vs. armature current; (c) torque vs. speed 


The differential compound motor has the series winding connected to produce a field that 
opposes that formed by the shunt winding. This type of motor has few applications and as a 
consequence is rare. 


Separately excited motor 

The separately excited motor has the field winding and armature connected to independent 
voltage supplies, as shown in Figure 4.37(e), from where it can be deduced that: 

Vf = IfRf (4.60) 

and: 


V 3 = E + I 3 R 3 


(4.61) 


The induced emf, E is generally far larger than the voltage drop across the armature winding, 
I a R a . As a result we may ignore the latter and assume that 


E = F a 


But from equation (4.50) for all dc motors 

E = 


Flence in this case 


V a = 

_ 

“ “ fej> 

where k is the design constant and 4> is the flux, which is dependent on the field current, If. 
If, If and hence <)> is kept constant then 

coocF a 


so 


nocK a 

That is, the motor speed is directly proportional to the armature voltage. 
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Worked example 

A separately excited dc motor has the motor held characteristic shown in 
Figure 4.39. The held winding resistance is 200 Q. The armature winding 
resistance is negligible. Calculate the motor speed when: 

(a) the held voltage and the armature voltage both equal 220 V 

(b) the held voltage is 220 V and the armature voltage is 150 V. 


Field current, I { = = 1.1 A 

From motor field characteristic fecj) = 1.8 Vs/rad 

(a) when armature voltage, F a = 220 V 

K 

= 122.3 rad/s 
_ 60to 


2tt 

(b) when armature voltage, F a = 150 V 


= 1168rev/min 


to = Yg = 83.3 rad/s 
n = = 795 rev/min 


Permanent magnet motors 

These motors use permanent field magnets instead of laminated poles with field windings to 
provide the stationary magnetic field, 4>, which is now sensibly constant in value and cannot be 
altered electrically. The analysis and theory of permanent magnet motors is identical to those of 
shunt- wound or separately excited motors with a fixed value of field excitation, so that k4> is 
constant. Their characteristics may be summarized as: 

• no-load speed is proportional to supply voltage 

• torque is proportional to current 

• speed falls from its no-load value with increasing torque. 

Permanent magnet dc motors are not normally used for large power applications but are very 
widely used within domestic and automotive applications such as: 

• window winders, windscreen wipers, fans and blowers in cars 

• battery-operated appliances such as electric shavers and cooling fans 

• battery-operated and track-powered toys. 

They are also widely used in various forms as dc servomotors within control applications. 
Conventional dc servomotors are simply high-quality permanent-magnet dc motors with 
wound armatures and commutators, but there are also variants such as: 

• ironless motors (in which the armature windings form a rigid lightweight structure of their 
own rather than being wound onto iron laminations) 
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• pancake and ‘printed armature’ motors in which the ironless armatures are fiat in shape 

• brushless dc motors in which the armature windings are stationary, the field magnets form a 
rotor, and the switching action of the brushes and commutator is carried out by solid-state 
switches. 

All of these behave electrically in the same manner as the conventional dc permanent magnet 
motor, even though their construction is totally different. The subject of dc permanent magnet 
motors will be revisited within Unit 5 on control. 


Learning summary 

By the end of this section you have learnt: 

/ the operation of dc motors; 

/ to derive the general torque equation; 

/ the different forms of dc motor; 

/ why the shunt motor produces virtually constant speed; 

/ how to vary the speed of a shunt motor; 

/ the characteristics of a series motor; 

/ how to control the speed of a series motor; 

/ why the speed of a separately excited motor is proportional to the armature voltage. 


4.10 Rectified supplies for dc motors 

The vast majority of power supplies around the world are alternating voltage. So to provide 
the power to drive a dc motor it is often necessary to convert the alternating supply to a direct 
voltage. To do this a rectifier is used. 

The principles of rectification were examined in Part 1, Unit 5. 

When an alternating (ac) voltage, Figure 4.45(a), is applied to a diode rectifier with a resistive 
load, like that in Figure 4.45(b), the diode only conducts when it is forward biased (i.e. the 
potential of the anode is positive with respect to the cathode). 

Ohm’s law states that: 


v = iR 

so while the diode is conducting, the load current has a similar waveform to that of the supply 
voltage hut when the diode is reversed biased it cannot conduct and the current remains at 
zero, producing a current waveform that comprises a series of positive half cycles, as shown in 
Figure 4.45(c). 

The voltage across the load resistor is called the output voltage. It has a waveform that has a 
similar shape to that of the load current. Assuming that the forward volt drop across the diode, 
approximately 0.7 V, is small compared to the supply voltage and may be ignored, then the 
average dc output voltage is given by: 

V m 

Average dc output voltage, V Q = 

Where, V m is the supply voltage maximum = \ ,r 2 X rms supply voltage. 
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Figure 4.45 (a) ac supply voltage waveform; (b) rectification using a single diode (half wave 
rectifier) and (c) resulting current waveform; (d) controlled rectification using a single thyristor 
and (e) resulting current waveform; (f) diode bridge rectifier (full wave rectifier) and (g) resulting 
current waveform; (h) thyristor bridge rectifier and (i) resulting current waveform 


Replacing the diode by a thyristor, as in Figure 4.45(d), enables the user to control the output 
voltage. The thyristor current is blocked in the forward direction until the thyristor gate is 
triggered by a small pulse from a second, independent voltage source, as shown in Figure 
4.45(e), thereby reducing the mean load current and hence output voltage to: 

V m 

Average dc output voltage, V 0 = [1 + cos a] 

A disadvantage of both these rectifiers is that current can only flow through the load during 
the positive half cycles of the voltage supply. For load current to flow during both the positive 
and negative voltage half cycles a bridge rectifier must be used. Figure 4.45(f) shows a diode 
bridge rectifier. When the supply voltage is positive, the current flows via diode D, to the load 
and back through D 4 During the negative half cycles D 2 and D 3 conduct, and D, and D 4 are 
cut off. The load current waveform and hence output voltage is shown in Figure 4.45(g). The 
mean voltage is given by: 

2V m 

Average dc output voltage = ^ 

This is twice as large as the output voltage for a single diode rectifier. 
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The corresponding output voltage waveform for a thyristor bridge rectifier (Figure 
4.45(h)) is shown in Figure 4.45(i), and the mean output voltage is given by: 

V m 

Average dc output voltage = [1 + cos a] 

In order to compare the diode and thyristor bridge rectifiers (Figure 4.45(f) and (h)) 
with other rectifier circuits described later in this unit it is useful to redraw the two 
circuit diagrams as in Figure 4.46(a) and (b) respectively. 




Figure 4.46 (a) Diode bridge rectifier; (b) thyristor 
bridge rectifier, both with resistive loads 


Replacing the resistive load by a dc motor affects the load current waveform because 
the motor has inductance. This delays current changes and inhibits the current from 
following direcdy changes in the applied voltage. 

Consider the diode bridge rectifier shown in Figure 4.46(a) hut now with an inductive 
load. 

The supply voltage and load voltage waveforms (Figure 4.47(a) and (b) respectively) are 
similar to those with a resistive load but the load current fluctuates, never falling to zero 
as illustrated in Figure 4. 47(c). The peaks in the current lag the voltage maxima, their 
magnitude dependent on the ratio of load inductance, L, to resistance, R. The larger 
this ratio the smaller is the current ripple, so that with a very high ratio of L:R the load 
current is virtually constant. 

As with a resistive load, diodes D, and D 4 conduct when the supply voltage is 
positive and D 2 and D 3 conduct when it is negative, hut the shape of the diode 
current waveforms are different, see Figure 4.47(d) and (e). Figure 4.47(f) shows the 
corresponding supply current waveform. 

Figure 4.48(a) to (f) show the voltage and current waveforms for a 
thyristor bridge rectifier, Figure 4.46(b), but now with an inductive 
load. Initially, thyristors Th 2 andThj are conducting. 

Because of the high ratio of load inductance to resistance, the load current does not fall 
to zero when the supply voltage reaches zero. Current flow is maintained through the 
two thyristors until the other thyristors are triggered at the angle, a. The output voltage 
will go negative during this period. 


o 


o 


o 


^ £. 


o 

o 



o 


Figure 4.47 (a) ac supply 
voltage waveform; 

(b) voltage waveform after 
full wave rectification; 

(c) resulting current in dc 
motor; (d) current in diodes 
D1 and D4; (e) current in 
diodes D3 and D2; (f) supply 
current 
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The mean output voltage can be found thus: 


Average dc output voltage, 

K = ^[j F m sin(wf)d(«£)] 

y 

= -^[-cos 0)t]£ + “ 

V m 

= — [— cos(tt + a) + cos(a)] 


2V m 

V 0 = cos a 


(4.62) 


The output voltage becomes negative if the delay angle, a, 
is greater than 90°. 

Three-phase bridge rectifiers have six devices and 
produce a smoother dc output voltage than single-phase 
rectifiers. Figure 4.49(a) shows a three-phase diode 
rectifier with a highly inductive load. The phase voltage 
and diode current waveforms are shown respectively in 
Figure 4.49(b) and (c).The diodes conduct in pairs, one 
in the top arm of the bridge with one in the bottom arm 
of another phase; D 5 with D 4 , D, with D 4 , D, with D 6 
and so on. The dc output voltage waveform is shown 
in Figure 4.49(d). The average dc output voltage for a 
three-phase diode bridge rectifier with a highly inductive 
load can be found thus: 


i r 90 ° 

V 0 = J F an sin(o)t)d(wt) — V hn sin(w£ — 120°)d(«£) 




triggered triggered 


o 



Let 

l*Wl Siknl - IVyJ = \V cn \ 


= 3 -^[-cos(o>£) + cos(w£ - 120°)]’°: 
3F m . pl /3 


o 

o 

o 


But 



where V m g ne = maximum line voltage 

= >/2X rms line voltage, Vy me , 

inline = ^ ^line 


(4.63) 



o 

Figure 4.48 (a) ac supply voltage 
waveform; (b) voltage waveform 
from thyristor bridge rectifier; 

(c) resulting current in dc motor; 

(d) current in thyristors Th-, and Th 4 ; 

(e) current in thyristors Th 3 and Th 2 ; 

(f) supply current 
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3-phase supply ^ 



“b 

-t r 



Figure 4.49 (a) Three-phase diode rectifier with a highly inductive load; (b) phase voltage 
waveforms and (c) diode current waveforms; (d) dc output voltage waveform 


Replacing the diodes by thyristors (Figure 4.50(a)) allows the angle at which the devices are 
triggered to be delayed, thereby enabling the output voltage to be controlled. The mean dc 
output voltage, V dc is given by: 


V ic 


3R m ,line 

cos a 

TT 


(4.64) 


where a = delay angle. 



Figure 4.50 (a) Three-phase thyristor rectifier with a highly inductive load; (b) phase voltage 
waveforms; and (c) thyristor current waveforms; (d) dc output voltage waveform for delay angle 
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The mean output voltage is zero when a = 90°, and negative when the angle is greater 
than 90°. As the delay angle is increased, the ripple on the output voltage becomes more 
pronounced. Even so, the output voltage waveform is smoother than for the equivalent single- 
phase rectifier and should be used in preference. A dc motor connected to this type of rectifier 
will perform almost as well as if it were connected to a pure dc supply. Hughes (2008) has 
identified two reasons for this. Firstly, the motor armature inductance smoothes the armature 
current, and since the torque is directly proportional to this current, ripples in the output 
torque are limited. Secondly, the armature inertia helps to inhibit variations in motor speed 
caused by torque ripples. 

Connecting the dc motor as a separately excited motor with the armature fed by a thyristor 
bridge rectifier and the field supplied from an independent diode bridge rectifier, as shown 
in Figure 4.51, enables the motor speed to be controlled by varying the thyristor triggering 
angle, a. 



Figure 4.51 Separately excited dc motor with the armature fed by a thyristor 
bridge rectifier and the field supplied from an independent diode bridge 
rectifier 


It was shown earlier, see equations (4.60)and (4.61), that for a separately excited motor: 

Field voltage, V f = I { Il f 
Armature voltage , F a = E + 7 a R a 
By ignoring the small armature resistance, R . a 

E = V a 

From equation (4.50) for any dc motor: 

Induced emf, E = fecjiG) 

K 

therefore to = 7 ~r 
kq> 

The following example demonstrates that controlling the variation in the trigger angle, alters 
the armature voltage and hence the motor speed. 


Worked example 

A separately excited dc motor has the motor field characteristic shown in 
Figure 4.39. The field winding resistance is 500 fl. The armature winding 
resistance is negligible. A three-phase 415 V (rms) ac supply is connected to 
the motor field winding via a diode bridge rectifier. The same ac supply feeds 
the motor armature via a thyristor bridge rectifier. Calculate the motor speed 
when the thyristor delay angle is: 

(a) 0° 

(b) 45° 
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From equation (4.63) 

Field voltage = average output voltage from diode rectifier 

_ 3F m w 

TT 

where V m ] jne = maximum alternating voltage = \[2 X rms line voltage. 
In this case 

Kniine = Bx 415 = 586.9V 

hence 

V( = 560.5V 

Field current, 

ilf = 5605 = 

I{ R { 500 L1 “ A 

From motor field characteristic, Figure 4.39, when If = 1.12A 
k$ = 1.8 Vs/rad 

Using equation (4.64): 

Armature voltage = average output voltage from thyristor bridge 

T/ 3F m , he 
V, = cos a 

TT 

where a = delay angle in triggering the thyristors 
(a) With the delay angle set at 0° 


From equation (4.50) 


(b) with the delay angle set at 45° 


_ 560.5 _ 


As the average output voltage from the diode rectifier, Vf is constant, If and fecj) 
remain constant at 1.12A and 1.8Vs/rad respectively, the speed falls to: 


. K _ 396.3 


= 220.2 rad/s 


H 1.8 

n = 2102rev/min 

The drop in speed is directly proportional to fall in armature voltage. 
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Learning summary 

By the end of this section you should have learnt: 

/ to draw the current and voltage waveforms for both diode and thyristor bridge rectifiers with an 
inductive load; 

/ to derive the do output voltage for diode and thyristor bridge rectifiers with an inductive load; 

/ to calculate the speed of a separately excited do motor supplied via bridge rectifiers. 


4.11 Inverter-fed induction motors and their 
characteristics 


The construction and operation of a three-phase ac 
induction motor connected to a fixed voltage, fixed 
frequency supply was explained in detail in Part 1, Unit 5. 
It will suffice to repeat the key points here. 

Induction motors are the most common motors 
encountered in industry and are the most rugged type 
of motor. They have two parts: an outer stationary frame 
called the stator and an inner rotating part called the 
rotor. These are shown in Figure 4.52. 

The simplest form of induction motor, the two-pole 
motor, has three separate coils wound in the slots on the 
inner circumference of the stator. The coils are mutually 
displaced by 120°. The rotor has silicon steel laminations 
keyed to a central shaft. Slots are cut in the laminations 
into which aluminium or copper conductors are fitted. 
The ends of the conductors are welded to aluminium end 
rings. There are no electrical connections to the rotor. 

When a balanced three-phase fixed frequency supply is 
connected to the stator windings, currents flow in the 
coils which produce a uniform magnetic field that rotates 
at constant speed in the air gap between the stator and 
the rotor. In the time it takes the currents to complete 
one cycle, the magnetic field makes one revolution of 
the air gap. This is illustrated in the sequence of diagrams, 
Figure 4.53. 


Punched silicon 
steel laminations 



Aluminium or 



to aluminium 
end rings 


Castellations 
on end rings 


Figure 4.52 (a) Stator and (b) rotor of a simple 
(squirrel-cage) induction motor 


The speed at which the field rotates is called the synchronous speed, n s . Measured in 
revolutions per minute, it is related to the frequency of the stator currents,/ in hertz (Hz), as 
follows. 


« s = 60/ 

The rotating magnetic field interacts with the rotor, inducing emfs in the rotor conductors. The 
latter are short-circuited by the end rings so currents flow in the conductors. These currents 
create subsidiary magnetic fields around the conductors. The subsidiary fields interact with the 
rotating stator field to produce forces on the rotor conductors, to create a torque on the rotor 
and make it turn. This torque is called the electrical or driving torque and its magnitude is 
proportional to the magnitude of the rotating magnetic field. 
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t = 1/f, (one cycle after start) 
I R at positive maximum again 



Figure 4.53 (a) Three-phase ac current waveforms; (b)-(e) sequence of energising the windings of 
a two-pole induction motor 


The rotor accelerates until the electrical torque produced by the currents exactly equals the 
mechanical load torque on the shaft. At this point the rotor is running at a speed, n, slightly 
slower than the synchronous speed. This difference in speed is expressed as a ratio and is known 
as the ‘slip’ or ‘per unit slip’, 5. 



The small difference between the speed of the rotating field and that of the rotor is 
fundamental to the operation of the induction motor. 

Figure 4.54 shows a typical torque vs. slip characteristic for a two-pole motor. 

When it is operating in steady-state conditions, an increase in the load torque causes the rotor 
to slow down and the slip to increase. This raises the emfs induced in the rotor conductors and 
the rotor currents, and creates more field distortion and more driving torque. The slip increases 
until the driving torque equals the load torque, and steady conditions are re-established. For 
most motors, the steady state slip varies between around 0.01 on no load and 0.10 when the 
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motor is driving full load. This means that a two-pole motor connected 
to a 50 Hz supply will run at a steady state speed somewhere between 
45 rev/s (2700rev/min) when s = 0.1 and 49.5 rev/s (2970rev/min) 
when s = 0.01. 

For many applications these speeds are too high. Manufacturers can 
reduce the synchronous speed, and hence the rotor speed, by winding 
the stator coils in a way that effectively increases the number of poles in 
the rotating magnetic field. This was demonstrated pictorially in Part 1. 
For example, a four-pole motor, with two N-poles and two S-poles, has 
a synchronous speed equal to half the supply frequency. 

n s = 60//2 = 30/ 



0.75 0.5 0.25 

Slip, s 

0 0.25 n s 0.5 n s 0.75 n s 

Rotor speed, n 


Connected to a 50 Hz supply the synchronous speed is 1500 rev/min, 
and the rotor speed varies between around 1350 rev/min when 5 = 0.1 
and 1485 rev/min when 5 = 0.01. 


Figure 4.54 Typical torque vs. slip 
characteristic for a two-pole motor 


Further increases in the number of poles reduce even more the synchronous speed and with it 
the rotor speed, as demonstrated in Table 4.1. Industrial induction motors are typically rated to 
run at a slip value in the order of 0.04 (4 per cent). 


No. of 
poles 

Pairs of 
poles 

Synchronous 
speed, ns as 
a fraction of 
frequency,/ 

Synchronous 
speed in rev/s 
when 

/= 50 Hz 

Synchronous 
speed in rev/ 
min when 
/= 50 Hz 

Rotor 
speed in 
rev/min 
when 
5 = 0.1 

Rotor 
speed in 
rev/min 
when 
5 = 0.04 

Rotor 
speed in 
rev/min 
when 
5 = 0.01 

2 

i 

/ 

50 

3000 

2700 

2880 

2970 

4 

2 

//2 

25 

1500 

1350 

1440 

1485 

6 

3 

// 3 

16.67 

1000 

900 

960 

990 

8 

4 

//4 

12.5 

750 

675 

720 

742.5 

2 P 

P 

f'P 

50/p 

3000/p 

2700/p 

2880/p 

2970/p 


Table 4.1 


With a fixed number of poles and fixed supply frequency the variation in rotor speed is very 
limited. To operate an induction motor over a wide speed range it must he connected to a 
variable frequency supply. 


Inverters produce variable frequency supplies. They convert direct current 
to alternating current of a chosen frequency. To understand their operation 
you should consider the simple single -phase bridge inverter shown in 
Figure 4.55. Diagonally opposite pairs of transistors conduct in turn. 

When transistors Tr 4 and Tr 4 are switched on and Tr 2 and Tr 3 are off, the 
current flows from the positive dc rail through Tq , the load andTr 4 to 
the negative rail, travelling through the load from left to right. When Tr 2 
and Tr 3 are turned on and Tr 4 and Tr 4 are switched off, the current flows 
through the load in the reverse direction as it travels from the positive dc 
rail to the negative. Diodes are connected in parallel with the transistors 
to short-circuit the large back emfs generated when the magnetic field 
reverses, that would otherwise destroy the transistors as they are turned off. 



Figure 4.55 Single-phase bridge inverter 
circuit 
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By turning each diagonal pair of transistors on for slightly less than half 
a cycle, the load voltage waveform shown in Figure 4.56(a) is created. 
Having a short period with no current at the beginning and end of each 
half cycle reduces the risk of two transistors in the same arm conducting 
simultaneously and short circuiting the dc supply. The output frequency 
is varied by altering the conduction times of the transistors, as illustrated 
in Figure 4.56(b). A more comprehensive description of inverter 
operation can be found in Hughes (2006). 

To control the magnitude of the load current, pulse width modulation 
(PWM) is often used. This involves the removal of a series of notches in 
each half cycle to produce a waveform like that in Figure 4.57(a). One 
way this is achieved is to switch Tr 4 on and off when Tr 4 is conducting 
and to turnTr 2 on and off when Tr 3 is on, taking precautions to avoid 
two transistors on the same arm being switched on simultaneously. The 
mean output voltage is then equal to: 

t on 

mean output voltage = V\= V dc X — y 

where f on = duration of the pulses, and t off = time between pulses 

To alter the mean output voltage the times at which Tr 2 andTr 4 are 
switched on and off are adjusted to change the ratio 


(ton + t oS ) 

In more sophisticated systems, microprocessors are used to vary the 
duration of the pulses and the spaces between pulses. This is shown in 
Figure 4.57(b). In this case, the average output voltage is given by: 


-Periodic time, T— 

o 



Periodic time, T = 7 ~ 7 

frequency,/ 

O 


Figure 4.56 Output waveforms for simple 
bridge inverter circuit (a) giving a low 
frequency and (b) giving a higher frequency 
output 


total duration of pulses in a half cycle 
Mean output voltage = F rfc X duration of half cycle 


Kic X 


Sf on 
£(f on + 


Vi 


For an induction motor supplied by an inverter to operate properly the 
number, width and spacing of the pulses must be chosen with extreme 
care and controlled by a microprocessor to make the load current 
waveform resemble as closely a possible a sine wave. Looking at the 
voltage waveforms in Figure 4.57, this might seem impossible. But, all 
repetitive waveforms with a fixed periodic time comprise an infinite 
series of sine and cosine waves of different magnitudes and frequencies, 
known as a Fourier series. Most textbooks on electric circuit theory 
explain how to determine the Fourier series for any waveform but we 
will limit our discussion to the example of the square wave illustrated in 
Figure 4.58a, for which the Fourier series begins: 

V(t) = -j^-(sinwf + -|sin3a)f + |sin5a)f + ...) where w = 2 t7 f 

The first term in the bracket is a sine wave of the same frequency as the 
square wave. This is called the fundamental. The other sine waves are 
called harmonics. They have smaller magnitudes, and their frequencies 
are multiples of the fundamental. Figure 4.58(b) shows the result of 
adding the first three terms in the series. As more terms are added, the 
waveform becomes closer to a square wave. 



Figure 4.57 PWM output waveforms with (a) 
constant pulse cycle over each half-wave 
and (b) variable pulse cycle 
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Vi 


V dc - 



o 


Original square wave 




Q 


Figure 4.58 (a) Voltage waveform in the form of a square wave; (b) square wave voltage waveform 
approximated by summation of fundamental and first two harmonics; (c) resulting current 
waveform through an inductive load 


The impedance of a motor is a combination of resistance, R and inductive reactance, X L , where 
X L = 2 tt fL 

which is directly proportional to the frequency. As a result, the reactance and hence the 
impedance of the motor are both larger at the harmonic frequencies than at the fundamental 
frequency. So, when a non-sinusoidal voltage, like that shown in Figure 4.58(b) is applied 
to a motor the harmonic components of the current waveform are attenuated more than 
the fundamental, producing a current waveform that more closely resembles a sine wave, as 
illustrated in Figure 4.58(c). 

In drawing these curves it was assumed that the resistance was negligible. 

Even when a pulse-modulated voltage waveform, like that in Figure 4.59(a), is applied, the 
current waveform is approximately sinusoidal, see Figure 4.59(b). The reader needing more 
details is again directed to Hughes (2006). 
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Figure 4.59 (a) Voltage waveform synthesised via pulse width modulation, and (b) resulting 
current waveform through an inductive load 


Three-phase variable frequency inverters are used to 
supply induction motors, as shown in Figure 4.60. Three- 
phase inverters are similar to single-phase inverters. The 
main difference is that the former has three arms to the 
latter’s two. The inverter supplies the currents to the 
motor stator windings to produce a rotating magnetic 
field in the air gap between the stator and rotor. Adjusting 
the frequency of the currents controls the rotational 
speed of the field (i.e. synchronous speed), and hence the 
rotor speed. 

Figure 4.53 showed the stator currents and magnetic field distribution in a two-pole induction 
motor at four points in a cycle. The currents were balanced, and the field had constant 
magnitude and rotated at constant speed. Analysing the current distribution at each instant in 
the cycle (which is beyond the scope of this book) it can be shown that the currents combine 
to produce a magnetomotive force (mmf) of constant magnitude equal to 

mmf = 1 SIN 

where I = stator phase current, N = turns in each phase of the stator winding. 

The magnitude of the mmf is independent of frequency and equals the combined mmf drops 
in the air gap between the stator and rotor, H ajr / alr and in the steel cores, Ff s teei4teei> where H air 
= magnetic field strength in the air gap, L in = total distance flux travels through air, which is 
the length of the air gap between the stator and rotor multiplied by two, as the magnetic field 
has to cross the air gap twice, H steel = magnetic field strength in the steel cores, and l steel = total 
length of the magnetic path in the steel cores. 

The reluctance of the air gaps is greater than that of the steel cores, so it will be assumed the 
latter can be ignored, and the mmf drop in the air gaps (H alr / air ) is directly proportional to 
the mmf produced by the stator currents. 

H im i nr oc IN 

The magnetic flux density, B in air is given by: 

B = |X 0 H ai r 

where |A 0 = constant, 4 tt X 10 -7 H/m 

So the flux density is proportional to the mmf produced by the stator currents, then 

BocIN 

As the air gap between the stator and rotor is uniform it may be concluded that the air gap 
flux, 4>, is also proportional to the mmf produced by the sinusoidal stator currents and will have 
constant magnitude. 



Figure 4.60 Three-phase inverter circuit 
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When the magnetic field rotates it links with the stator windings. Figure 4.53(b) shows the 
situation in a two-pole motor at time, t = 0 when maximum flux links with the red phase coil, 

R-R'. 


The field is rotating at the synchronous speed, n s rev/ min, which related to the frequency,/ by: 
n s = 60/ 

so the angular velocity of the field, to equals: 

to = 2i / 

The flux linkage with a coil, i|t is defined as: 

flux linkage = flux linking a coil X number of turns in the coil 

4» = 4»iV (4.65) 

Taking the maximum flux linkage with the red phase coil R-R' at time, t = 0 to be i|/ m the 
general expression for the flux linkage with the red phase is given by: 

i|/ = v|i m cos c of 

According to Faraday’s law, the emf induced in a coil is: 

d4> 

e = (4.66) 


so the emf induced in the red coil will equal: 

e R = — coi|i m sin(ot (4.67) 

The emfs induced in the yellow and blue coils can be determined similarly. Figure 4.53(c) 
shows that maximum flux linkage with the yellow phase occurs one third of a cycle (or 2 tt/ 3 
radians) later than the red phase, so: 

flux linkages with the yellow phase coil = i|c m cos(co< — 2tt/3) 
and the induced voltage in yellow coil, 

e Y = — coi)j m sin(cot — 2 tt/3) (4.68) 

Maximum flux linkage occurs with the blue phase, two-thirds of a cycle (or 4 tt/ 3 radians) later 
than with the red coil (Figure 4.53(d)), so: 

Flux linkages with the blue phase coil = i|/ m cos(wt — 4 tt/ 3) 
and the induced voltage in blue coil, 

e B = -ui|j m sin (tat ~ 4tt/3) (4.69) 

Thus from equations (4.67), (4.68) and (4.69) the rms value for induced emf in each phase, E 
may be found 



But ca = 2 tt/ and from equation (4.65) i|i m = Nc() m 
so 

2irfN4>« 

h yfl 


(4.71) 


The induced emf in each phase of the stator in an ac induction motor opposes the applied 
phase voltage, V. If the stator winding impedance is considered to be negligible, there will be 
no volt drop in the stator winding and 


\E\ =\V p \. 
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Rearranging equation (4.71) and substituting for E 


Let k = 


constant = 


1 

/2tt N 


. J 2V v 
^ m 2n fN 



(4.72) 

(4.73) 


The magnitude of the rotating magnetic flux is directly proportional to the applied voltage, V p 
and inversely proportional to the frequency,/ To keep the magnitude of the flux constant the 
frequency and the voltage must be adjusted simultaneously (Hughes, 2006). 


As explained earlier, the motor torque is proportional to the magnitude of the rotating 
V p 

magnetic flux. So with <j> m = k~j- (equation (4.73)) we can also say that the torque is also 
proportional to the applied voltage divided by the frequency (V/f). 


When a variable frequency inverter is used to drive an induction motor, the output frequency 
of the inverter is set to produce the required speed of flux rotation in the motor, i.e. 
synchronous speed, and the inverter output voltage adjusted to produce full load torque. 


To alter the motor speed and maintain full load torque the inverter frequency and output 
voltage must be adjusted simultaneously, to keep the V/f ratio, and hence the magnetic flux, 
constant (Hughes, 2006). 


Figure 4.61(a) shows the effect on the motor torque versus speed characteristics of keeping 
(V/f) constant at different frequencies. 




o o 

Figure 4.61 (a) Torque-speed characteristics of induction motor operated at varying frequencies with constant V/f ratio; 
(b) reduced torque above ‘base speed’ where voltage is held constant as frequency increases 


The simultaneous increase in inverter frequency and voltage is only possible until the voltage 
reaches the rated value for the inverter and motor, as specified by the manufacturers. The frequency 
at this point is called the ‘base speed’ and is usually set by manufacturers at 50 Hz or 60 Hz. 

It is unsafe to operate electrical equipment above its rated voltage. If the inverter frequency 
is increased above the base speed, the voltage must be held constant at its rated value and the 
magnitude of the flux allowed to fall. The drop in flux reduces the emfs induced in the rotor 
conductors, the currents flowing through the conductors, the forces on the conductors and hence 
the torque on the rotor. As a result there is a fall in torque at frequencies above the base speed, as 
shown in Figure 4. 61(b). The reader needing more details is again directed to Hughes (2006). 
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In the constant voltage region (i.e. when running at frequencies above the rated frequency 
the torque-speed curve is less steep. Above base speed, the maximum allowable power of the 
motor is constant and the allowable torque reduces inversely with the speed. Hughes (2006) 
presents an interesting discussion on the limits to the torque-speed envelope for induction 
motors. He suggests that the constant power region extends to around twice base speed 
(limited by the peak torque available at a given frequency) and within this region the motor 
may be operated at higher slips than are permitted below base speed. 

All the curves in Figure 4.61(a) and (b) show a small drop in motor speed as the load torque is 
increased in the normal operating range. In situations where this fall in speed is unacceptable, a 
closed-loop speed control system, like that shown in Figure 4.62(a) must be used. 



Desired speed 


o 



o 


Figure 4.62 (a) Closed-loop control of inverter drive; (b) frequency changed by control loop 
to restore motor running speed to desired value n 1 despite torque increase from T-, to T 2 


Such systems compensate automatically for any change in motor speed by raising the inverter 
output frequency as the load torque is increased. Figure 4.62(b) illustrates this. Assume that 
the motor is running on no load, the torque is T\ and the speed is (at operating point 1). 
Without closed-loop speed control an increase in load torque to T 2 will cause the speed to 
fall to « 2 » (operating point 2) but with closed-loop control, the frequency will be increased 
incrementally as the load torque rises to T 2 to maintain the speed constant at ri\ (operating 
point 3). 
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Approximate characteristics of induction motors fed from inverters 

The analysis of an induction motor was covered in Part 1 and the information can be used to 
draw a torque-speed curve such as that shown in Figure 4.54. The non-linear nature of the 
torque-speed equation underlying this curve makes it difficult to use this equation within drive 
system calculations, so it can be convenient to derive approximate torque-speed relationships 
relating to the operating region of the motor as shown in Figure 4. 61. Within this range the 
torque-speed characteristic of the motor is approximately linear. It was shown in equation 
(5.147) in Part 1 that the torque-slip characteristics of a simple three-phase AC induction 
motor are given by the following equation: 

3 p E 2 2 as 

< 4 - 74 > 

where: 

p = number of pairs of poles on stator 
w = 2TT/ = circular frequency of mains supply (rad/ s) 

E 2 is the emf induced in each phase of the rotor winding when the rotor is stationary 
X 2 is the rotor reactance per phase when the rotor is stationary 
a = R 2 /X 2 = ratio of resistance to reactance of rotor winding 

s = per-unit slip = — *— 

where n s and n are the synchronous speed and the running speed of the motor respectively. 

Using equations (5.111), (5.112) and (5.116) from Part 1, the stator phase voltage V\ can be 
expressed as: 



and the rotor standstill reactance per phase, referred tc 
N x 2 


so that equation (4.74) becomes: 


Torque 


3 p V-i 2 as 

"to Xf (a 2 + s 2 ) 


(4.76) 


(4.77) 


Note also that rotor resistance per phase, referred to the stator, is: 


so that a can be re-expressed as 


n 2 2 - 


Ri 


(4.78) 


(4.79) 


As all quantities are henceforward only referred to the stator, the subscripts for V\, X, and // 
can be dropped from equations (4.77) to (4.79). 

The operating region of the motor typically involves slip values in the range 1—10 per cent, 
with the most common value being around 4 per cent. This is around a fifth of the typical 
value of a (which is likely to be around 0.2 or 20 per cent), so that (a 2 + 5 2 ) can be reasonably 
approximated as a 2 . Noting the definitions of a, « and s, and that 
60/ . 

n s = (in rev/ min) 
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Equation (4.77) can be approximated as: 


„ _ ?>P V 2 s 

Torque ~ 


i 3 r iv\ 2 (6Qf 
' 60 2tt R\f) \ p 


Torque 

r, relating the torque instead to the angular velocity of the rotor, w n measured in rad/ s: 


Torque = 


( 7)1 


4tt 2 R\f) \ p 


( 4 . 81 ) 


( 4 . 82 ) 


In practice, nearly all induction motors encountered today are squirrel-cage motors so 
R will be constant but unknown. By adapting an approach described (for the constant- 
frequency situation) by Wildi (1991), it is often easier to infer the approximate torque— speed 
characteristics from the design data (‘nameplate data’) for the motor, involving its rated power, 
P rat ed> rated speed, n rated> and at its rated frequency, / ated , and voltage, V ated , and to write an 
expression roughly equivalent to equation (4.81) as: 


Torque ~ 


I /ated \ 2 / ] 

IA 2 ( 60/ \ 

U a J l 

f)\p n ) 

•//rated f / 

V) 2 f 2 vf \ 

UatedJ l 

7 ) [ 


( 4 . 83 ) 


1 6 Q/rat ed 


(Strictly speaking, equations (4.81) and (4.82) relate to drawing a straight-line tangent 
(at 5 = 0) to the torque-speed curve, while the approach of Wildi (1991) corresponds to 
drawing a straight line or chord between the points on the curve corresponding to zero slip 
and rated slip. In practice, the difference between these lines is small.) 

Using these equations it is now possible to generate the approximately linear regions of the 
families of torque-speed curves given in Figure 4.54 and Figure 4.61: 


• For the single curve shown in Figure 4.54 relating to a motor driven directly from its rated 
power supply/ = / ate d. the approximate torque-speed characteristic is simply given by 
inserting the rated frequency and voltage into equation (4.83), which reduces to: 


Torque = K\ 


' 60 /rate 


( 4 . 85 ) 


• For the curves relating to/ </ ate d as shown in Figure 4.61(a) and the left-hand part of 
Figure 4.61(b),/ and V are varied in proportion, as described earlier within the present 
section, so that their ratio remains constant, and equation (4.83) now reduces to: 

Torque = K [^f ~ nj ( 4 . 86 ) 


• For the curves relating to/> / ate d, as shown in the right-hand part of Figure 4.61(b), the 
voltage remains at the rated value V rMe d while the frequency increases, so equation (4.83) 
now reduces to: 

Torque = K^j (y - ») 
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• For completeness, it can be stated that for the situation where a motor is driven at its rated 
frequency at reduced voltage (for example, via a variable transformer or a so-called ‘soft 
starter’) equation (4.83) reduces to: 


Torque = k( (^ - 


(4.88) 


All of these approximations are valid within the approximately linear region of the motor s 
torque— speed curve, which applies when the actual value of torque lies within the following 
limit (extended from Wildi (1991) to consider varying frequency): 


Torque 


(Rated torque) X | jj 


(4.89) 


where: 


60P rated 

Rated torque = 


(4.90) 


It should be emphasized that this limitation on the applicability of the linearly approximated 
torque-speed characteristic does not necessarily correspond to the limitation on allowable 
torque for the motor. Once again, the reader is referred to Hughes (2006) for a discussion of 
the allowable operating envelope for an inverter-fed motor. 


Worked example 

A 25 kW, 415V (line-to-line voltage), 50 Hz, 1440rev/min squirrel-cage 
induction motor is fed from a variable frequency inverter. The voltage and 
frequency are varied in proportion, up to the rated frequency; the voltage is 
held constant above this frequency. This arrangement is used to drive a blower 
that has a torque— speed characteristic given by L = 5.556 X 10 _5 n 2 where 
torque L is in Nm and n is in rev/min. Determine the inverter frequency and 
line-to-line voltage at blower speeds of 720 and 1560 rev/min. 


For this motor the rated power is 25 000W and the rated speed is n rated = 1440 rev/min. 


It can be seen from Table 4.1 that for a rated speed of 1440 rev/min the synchronous 
speed is almost certainly 1500 rev/min and hence the motor is a four-pole motor with 
p = 2. Therefore: 


1 60 / ra te 


-= 2.763 Nmmin/rev 


For the blower at n = 720 rev/ min: 

Torque = 5.556 X 10“ 5 X 720 2 = 28.8 Nm 


720 rev/min is below the rated speed, so V/f = V nted / f med , and equation (4.86) applies. 
This can be rearranged to give: 


p / Torque \ 
J 60 K +n 


2 / 28.8 
' 60 1 2.763 


+ 720 = 24.35 Hz 




► 
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V= Km 


The rated line voltage is 415 V so the rated phase voltage is 41 5/V3 = 239.6 V 
The voltage is obtained by reducing the rated voltage in proportion to the frequency: 

<£ 

so the line-to-line voltage is 116.7 X V3 = 202.1V 

It is important to check that the approximately linear operating region, and hence the 
applicability of the approximate equation, have not been exceeded. Noting that the rated 
phase voltage is 239.6V and the actual phase voltage is 116.7V, then using equation (4.89): 


//rat 


Maximum permissible torque = (Rated torque) X 

= 165.8 Nm 


sHff 


= 60 X 25 000 / 50 W 116.7 V 

2tt X 1440 1 239. 6/ 124.24/ 


which is very far from being exceeded by the actual torque of 28.8 Nm. 

For the blower at n = 1560rev/min: 

Torque = 5.556 X 10~ 5 X 1560 2 = 135.2 Nm 
1560rev/min is above the rated speed, so V = V nte( b an d equation (4.87) applies. This c 


t be rearranged to give: 


Torque = T = 60 K^y- - Kl^ 


> / , 


Multiplying by / and rearranging gives: 

Lte/ 

Tf - 60 K J -y-f+ Kf mt Jn = 0 

Inserting the values for the present problem gives: 
co 2 

135.2/ - 60 X 2.763 X 2.763 X 50 2 X 1560 = 135.2/ - 207 250/+ 10 775 700 = 0 

This is a quadratic in/ which has the solutions: 

207 250 ± \! 207 250 2 - 4 X 135.2 X 10 775 700 


/=“ 


2 X 135.2 


= 53.9 Hz or 1479 Hz 

As the new running speed is only slightly above the rated speed of 1440 rev/ min, we would 
expect the frequency to be slightly above the rated frequency of 50 Hz, so the first of the 
alternative solutions is clearly the correct one. Recall also that V = F ra ted = 239.6V 

Again it is important to check using equation (4.89) that applicability of the approximate 
equation has not been exceeded: 

Maximum permissible torque = (Rated torque) X 



= 25 000 
2tt X 24 


/ 50 \ 2 / 239.6 \ 2 
1 239. 6/ l 53.9 ) 


= 142.7 Nm 


which is slightly greater than the actual torque of 135.2 Nm so the motor is (by a small 
margin) within its linear region and the calculations are valid. 
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Learning summary 

By the end of this section you should have learnt: 

/ the principles of operation of an induction motor; 

/ the operation of a simple inverter; 

/ the principles of pulse width modulation; 

/ induction motor torque is proportional to the applied voltage divided by frequency [V p /f)-, 

/ at frequencies above ‘base speed’, the torque falls as the frequency increases; 

/ approximately linear torque-speed characteristics of induction motors can be obtained for 
relatively low values of slip. 


4.12 Other sources of power: pneumatics 
and hydraulics 

Electric motors (particularly dc permanent magnet motors for light duties and ac induction 
motors for heavier duties) are by far the most common means of providing drive to mechanical 
systems. However, in certain circumstances it may be appropriate to use other means of 
providing drive, notably the use of pneumatic and hydraulic drives. Electric motors are cheap, 
efficient, reliable and controllable, but they can be heavy in relation to the power needed, they 
can overheat or burn out if stalled, and they are not ideal for providing either very large forces 
or long linear movements. Pneumatic and hydraulic actuators and transmission systems can in 
some situations overcome these problems. 


Pneumatics 

Pneumatic actuators can be used in preference to electrical actuators for the following reasons: 

• better power-to-weight ratio (i.e. less weight for a given power output) 

• better power-to-space ratio (i.e. smaller for a given power output) 

• high torque of pneumatic motor at stall, controlled via pressure and with no damage to the 
motor being caused by stall 

• long stroke of pneumatic cylinder. 

A widely used form of pneumatic motor, used to provide continuous drive, is the vane motor, 
which operates as shown in Figure 4.63. 
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Other types of pneumatic motor are based on pistons. 

The characteristics of pneumatic vane motors can be approximated as a 
straight-line graph joining the high stall torque to the high no-load speed 
(Figure 4.64), typically with a small elbow representing reduced torque on 
start-up. 

A particular advantage of pneumatic motors is that their characteristics can 
easily be altered by: 

• varying the pressure (which changes both the stall torque and the no-load 
speed) as shown in Figure 4.65(a); 

• throttling the inlet or the outlet (which primarily changes the no-load 
speed) as shown in Figure 4.65(b). 



Figure 4.64 Characteristic of 
pneumatic vane motor 



.Effect of constricting 
(throttling) supply 
exhaust 


Speed Speed 

Figure 4.65 Effect of (a) varying the supply pressure and (b) throttling a pneumatic vane motor 


Typical applications of rotary pneumatic motors include: 

• pneumatic wrenches and similar devices for tightening bolts in assembly lines, tyre service 
stations etc. - these exploit both the good power-to-weight ratio and the stall-tolerant 
characteristics of pneumatic motors; 

• bottling plant, where bottle tops need to be tightened, again exploiting the ability of 
pneumatic motors to provide a pressure-limited value of stall torque without damage. 

Pneumatics are also particularly useful for providing linear motion, primarily via pneumatic 
cylinders. These provide moderately large forces over a stroke or displacement ranging from a 
few millimetres up to the order of a metre. When used in conjunction with solenoid-operated 
valves, they provide a means of obtaining rapid linear motion under electrical or computer 
control, and are widely used in automated production environments. The force from a 
pneumatic cylinder is given by: 

F = pA (4.91) 

where F is the force obtained, p is the pressure of the compressed air supply and A is the area 
of the piston. In practice, the available force may be a little lower than this, due to friction 
between the piston seals and the cylinder. 

Pneumatic systems 

In order to be able to use a pneumatic motor or actuator, it is of course necessary to have 
a source of compressed air. There is no pneumatic equivalent of the electricity supply grid, 
although most factories, engineering laboratories etc. are often equipped with a compressed air 
supply, which may be supplied centrally within the factory etc. or via local compressor units. 
While details of different systems vary depending upon size, cost etc., a compressed air system 
typically includes the following components (Figure 4.66): 

• electric motor (or other mechanical power source) to provide the mechanical power to the 
system; 

• several filters, including an inlet filter, to remove airborne particles, which may cause damage 
to the compressor or actuators; 
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• compressor — typically a one- or two-stage reciprocating compressor, although other types 
(e.g. screw compressors) may he used; 

• a control system to ensure that the correct pressure is maintained without excessive wastage 
of energy — this may be a simple pressure switch for switching the motor on and off as 
required, or may involve, for instance, the lifting of the compressor’s inlet valves so that it 
idles without compressing air, or may involve varying the speed of the motor; 

• ‘receiver’ or pressure vessel for storing the compressed air; 

• pressure relief valve, or safety valve, to prevent excessive pressure building up in the event of 
the failure of the control system; 

• network of pipes to carry the compressed air around the factory; 

• oil-mist lubricators to add lubricant to the air for use within motors. 

Larger systems will also include a cooler and/or condenser to allow the dissipation of heat 
generated by the adiabatic compression of air, and to allow the condensation of moisture from 
the air as it cools after compression. An oil/ water separator may also be included to avoid 
contaminating the drains with lubricating oil from the compressor. 

Efficiency and energy utilisation issues of pneumatic systems 

Pneumatic systems and actuators give a more intense, robust and lightweight source of power 
than is achievable with electricity alone, and provide an excellent route to high-speed, low- 
cost automation. However, poor design and working practices can easily lead to greater 
inefficiencies than those inherent in purely electrical systems. The Carbon Trust state that 
‘Of the total energy supplied to a compressor, as little as 8-10 per cent may be converted into 
useful energy that can do work at the point of use’ (Carbon Trust, 2009). Reasons include 
the significant proportion of the energy required for compressing air which is lost as heat, and 
leakage from the air system. It has been stated that ‘35—50 per cent leakage is not uncommon’ 
(Carbon Trust, 2008); other issues include friction in the compressor and air motor, 
pressure drops in filters etc. Careful searches for leaks, elimination of wasteful applications of 
compressed air, and the use of filters with low pressure drops, can all be used to reduce the 
wastage of compressed air and unnecessary use of energy. Although compressed air itself does 
not cause pollution, it is important for engineers to be aware of the scope for inefficiencies, 
and to be aware that the energy needs and associated cost and environmental issues involved in 
air compression can be significant. 
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Figure 4.67 Cutaway view of pneumatic cylinder 


Hydraulic systems 

Hydraulic systems involve the use of liquids under very high pressure, and achieve even greater 
power-to-weight and power-to-space ratios than are achievable via pneumatic systems. There 
are obvious similarities with pneumatics (for example, both involve fluids under pressure, and 
both can be used with cylinder actuators and rotational motors) but there are major differences 
which can be summarized in Table 4.2: 


Characteristic 

Pneumatics 

Hydraulics 

Working fluid 

air 

liquid (usually oil or ester) 

Pressure 

6-8 bar typical 

50-200 bar typical, can go up to 
250-700 bar or higher 

Construction of 
actuators 

lightweight, typically from aluminium 
extrusions and diecastings 

much more substantial, typically 
ferrous castings and steel tubing 

Forces and torques 
available 

medium (tens of N up to around 1 0 kN 
from linear actuators) 

large (e.g. up to 1 0000 kN from linear 
actuators) 

Precision 

low - not usually used for precise 
positioning (though some pneumatic 
control systems provide precise 
control of flow etc. via the use of 
feedback) - more usually used for 
simple two-position operation 

used for precise positioning e.g. within 
machine tools, and under control of 
servo valves 

Stiffness 

not stiff because air is compressible 

very stiff as hydraulic fluid is virtually 
incompressible - actuator becomes 
rigid if inlets/outlets are blocked off, as 
in the case of cranes, jacks etc. 

Cost 

relatively low-cost 

relatively high-cost due to robust 
construction 

Power-to-weight and 
power-to-space ratios 

very good - makes pneumatics 
suitable for power-intensive handheld 
tools, for example wrenches and 
grinders. 

excellent - makes hydraulics suitable 
for providing power locally within 
aerospace applications, for example 
for operating landing gear and control 
surfaces in aircraft 


Table 4.2 


One of the major features of hydraulic systems is the ability to obtain a very large mechanical 
advantage (Figure 4. 68). The classic example is a hydraulic jack, where a reciprocating pump 
with a small bore requires a relatively small force (but many operating strokes) to provide the 
fluid to operate a cylinder of large bore which is used to raise a heavy load by a small distance. 

In the context of mechanical drive systems, a pump with a small displacement (which pumps 
a small amount of fluid per revolution) can be used to generate a very large force or torque 
when connected to a linear or rotational actuator with a large total piston area. 
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Figure 4.68 Mechanical advantage produced by 
simple hydraulic system 


Hydraulic cylinders 

As indicated above, these are similar in concept to pneumatic cylinder 
actuators but operate at much higher pressures and are consequently 
much more solidly built (Figure 4. 69). They may be single-acting or 
double-acting, and provide a large force proportional to the pressure 
supplied to them. They are widely used where a large force is required, 
for example in raising and lowering the buckets of excavators and the 
jibs of cranes. 

Hydraulic motors 

Hydraulic motors are used in a variety of situations where a very 
large torque is required in a given space. Examples include providing 
drive to the tracks of track-laying excavators, driving rollers and 
other large-torque applications in industry, and within variable-speed 
hydraulic drives. Hydraulic motors are typically based on the use of 
pistons, and are generally radial (e.g. Figure 4.70, with the pistons 
typically acting on some form of radial cam) or axial (e.g. Figure 
4.71, with the pistons acting on some form of inclined plate or 
swashplate). 



Figure 4.69 Cutaway view of a typical hydraulic 
cylinder actuator (Design & Draughting 
Solutions Ltd - www.dds-ltd.co.uk) 




o o 

Figure 4.70 (a) Construction and (b) operation of a radial piston 
hydraulic motor (Hagglunds Drives) 
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Because pressure drops due to dynamic head losses are very small in comparison with the 
pressures at which hydraulic systems operate, the torque-speed characteristic of a hydraulic 
motor is very nearly a horizontal line corresponding to a constant torque proportional to 
pressure (Figure 4.72). 

Li 


Figure 4.72 Torque-speed characteristic of hydraulic motor 

Hydraulic power supplies 

In order to power hydraulic actuators\, a pump must be used to provide a source of hydraulic 
fluid at high pressure. This clearly requires a mechanical power source, which is typically 
an electric motor (normally an induction motor) but can be a diesel engine. In a factory or 
laboratory environment a ‘power pack’ is often used, a self-contained unit consisting of the 
following components (Figure 4.73): 

• electric induction motor 

• fixed-displacement pump (e.g. vane pump) 

• oil reservoir 

• pressure regulator (often a relief valve) 

• sometimes a cooler (either a water-cooled heat exchanger, or an 
air-cooled device similar to a car radiator) to dissipate the heat 
generated in the pump and regulator 

• filters to remove particles. 

In other applications it can be convenient to use variable- 
displacement pumps which allow the flow rate from the pump to be 
adjusted directly, for example to allow a piece of machinery (such 
as the wheels or tracks of a piece of construction machinery) to be 
driven with variable speed and direction. 
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Figure 4.73 Hydraulic power pack showing 
75 kW motor and pump, and below them the 
oil storage tank 
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Hydraulic variable-ratio drives 

One convenient application of hydraulics is within variable- 
ratio drives, which behave as a ‘gearbox’ with a ratio which 
is infinitely variable between given positive and negative 
values. The mechanism (Figure 4.74) consists simply of a 
variable-displacement pump driving a fixed-displacement 
motor. With the swashplate on the pump in its neutral 
position, no fluid is pumped, and the motor and output 
shaft do not rotate. However, if the swashplate is moved 
from neutral towards its positive or negative limit, the 
motor and output shaft will rotate at speeds up to the 
maximum rate in either direction. 



o 



Figure 4.74 Diagram of variable-ratio drive providing an 
output rotation: (a) in the forward direction; (b) in the 
reverse direction 


Learning summary 

By the end of this section you should have learnt: 

/ the similarities and differences between pneumatic and hydraulic systems; 

/ why pneumatic or hydraulic actuators might be used in preference to electric motors; 

/ the ancillary equipment needed to power pneumatic and hydraulic systems; 

/ how hydraulics can be used to provide a variable-ratio drive; 

/ the cost and energy-efficiency issues associated with pneumatic power and compressed air. 


4.13 Steady-state operating points and 
matching of loads to power sources 

Consideration of torque-speed characteristics of loads and engines etc. provides an opportunity 
to introduce the concept of the steady-state operating point at which the motor or engine will 
drive the load without acceleration or deceleration. This concept occurs widely in all branches 
of engineering (mechanical, electrical, pneumatic etc.) and is closely related to the concept of 
‘impedance matching’. In practice, it is often necessary to use some kind of transmission to 
match the characteristics of the power source to the load, so that the power source and load 
interact in an optimal manner. 

Steady-state operating points 

The concept of an operating point is straightforward: if the torque-speed characteristics of the 
load and of the motor or engine are both known, then the combination of torque and speed at 
which the motor will drive the load corresponds to the point at which the characteristics cross. 
This is straightforward (and obvious) to determine graphically if the characteristics are plotted; 
it can also be found if the equations of the characteristics are known, simply by expressing 
torque as a function of speed for both characteristics, then equating the two expressions. 
Solution of the resulting equation may or may not be possible analytically, so an iterative 
numerical solution may be required. 
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An issue related to the concept of operating points concerns the scenario if, at any instant, the 
system is running at a speed which does not correspond to the operating point - for example 
if the system is running more slowly than the steady-state running speed. Assuming (for the 
purpose of the argument) that the characteristics are valid for transient (non-steady state) 
conditions, it will be observed that the torque available from the engine or motor exceeds that 
required by the load. In that case, the surplus torque is available for accelerating the load, which 
will speed up until the steady-state running speed is approached asymptotically. 


Worked example 

Example of an internal 

combustion engine problem 

(a) A naturally aspirated 
diesel engine with the 
characteristic given in 
Figure 4.75 is driving a 
pump at 2000 rev/min; the 
pump consumes a power 
of 30 kW at this speed. 

How many litres of fuel are 
required per hour? Assume 
the specific gravity of the 
fuel (effectively, the density 
in kg/litre) is 0.84. 

(b) A sudden flood occurs, 
and the engine is set to 
maximum injection by 
the control system. If the 
combined inertia of the 
pump and engine is 

4 kg.m 2 , at what rate (rev/ sec 
the throttle is opened fully? Yc 



Contours of specific fuel 
consumption (SFC) (g/kWh) 


0 500 1000 1500 2000 2500 3000 3500 

Engine speed (rev/min) 

Figure 4.7E Representative torque-speed-SFC 
map for a diesel engine (adapted from Heywood, 
Internal Combustion Engine Fundamentals, 
McGraw Hill, NY, 1988) 


! ) does the system accelerate at the instant 
u may assume that the engine and pump 
characteristics both remain valid for this sudden change. 

(c) If the pump has a square-law torque-speed characteristic (i.e. torque 

proportional to speed squared) , what is the maximum speed at which the 
engine can drive it? 

(a) Torque = 30000/(2 tt X 2000/60) = 143.2 Nm 
Specific fuel consumption = 245 g/kWh 

Mass of fuel consumed per hour = 30 X 245 = 7350 g = 7.35 kg 
Volume of fuel per hour 7.35/0.84 = 8.75 litres 

(b) Maximum engine torque at 2000 rpm = 400 Nm 
Load torque = 143.2 Nm 

Excess torque for acceleration = 400 — 143.2 = 256.8 Nm 

Acceleration = net torque/moment of inertia = 256.8/4 = 64.2 rad/s 2 = 10.2 rev/s 2 

(c) Assume equation of form L = c N 2 where L is torque in Nm, c is a constant and N is 
the speed in rev/min 

Constant c = initial torque/ (initial speed) 2 = 143.3/2000 2 = 0.000 035 8 
Characteristic of pump: L = 0.000035 8 N 2 (Nm) 

N = 3150 rpm where this characteristic crosses the maximum torque line 
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Matching of loads to power sources 

It very unusual for the characteristics of a power source to match closely the requirements of 
a load without the need for any kind of transmission. One example of where a direct drive is 
possible is the steam railway locomotives (Figure 4.76(a)), where the pistons drive a crankshaft 
that forms one of the main axles carrying the wheels, often with the cranks being integral with 
the wheels. Another is a simple piston-engined aircraft (Figure 4.76(b)), where the propeller is 
mounted directly on the end of the engine s crankshaft. 


o 


o 


Figure 4.76 Two examples of direct drive from engine to 
load (a) chassis of steam locomotive; (b) piston-engined 
aircraft 




Contrast these two examples, respectively, with a modern diesel locomotive, where a complex 
arrangement of generator and motor is used to link engine to wheels, and a ‘turboprop’ aircraft 
such as a military transport plane, where the high-speed gas turbine engine is linked to the 
propeller via a reduction gear. 

Matching of steady-state characteristics 

Probably the most common challenge in matching load to power source is where the steady- 
state running characteristics of the load do not cross the characteristics of the power source in a 
useful manner, for instance where the power source runs too fast and does not provide enough 
torque (see Figure 4.77). 

In such a case it is straightforward to shift the torque— speed characteristic via the use of a 
transmission ratio, as described in Section 4.7, so that a useful operating point can be chosen. 



Figure 4.77 Mismatched torque- 
speed characteristics 



Figure 4.78 Torque-speed characteristics 
which cross at a useful operating point 
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Worked example 

An example of such a problem might be to consider the matching of a load 
such as a blower to a motor (see Figure 4.79). 



Figure 4.79 Blower driven from motor via reduction gear 
Assume that the characteristics of a load are: 

L = 20 4- 0.2co + 0.025(o 2 

where L is the torque in Nm and to is its angular velocity in rad/ s (both 
measured at the shaft directly driving the load). The load is to be driven via 
a 3:1 reduction ratio from a 5 kW induction motor whose characteristics are 
approximated as: 

L' = 5(50tt - (o') 

The raw characteristics of the motor and load do not cross in a useful manner 
(Figure 4.80) 

— Characteristics of load 
— *— Characteristics of motor 


m 

0 50 100 150 200 

Angular velocity (rad/s) 

Figure 4.80 Mismatched characteristics of motor and blower load 

This problem is, of course, tackled by connecting the motor and the load via the 
transmission system illustrated in Figure 4.79. The torque-speed characteristics of the load 
can now referred to the axis of the motor: 

U = U 3 



of = 3o) =h w = (o'/3 
L = 20 + 0.2w + 0.025(o 2 
L' = \ (20 + 0.2u + 0.025(O 2 ) 


= ^20 + 0.2 T + 0.025^ T j j 

= 6.667 + 0.022 22w' + 0.000 9259w' 2 


(4.92) 
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This referred characteristic now crosses the characteristic of the motor to give a stable 
operating point as shown in Figure 4.81 



This operating point can be found graphically as the point where the characteristics of 
power source and load (referred, for instance, to the power source axis) cross. However, 
if (as in the present case) the torque-speed characteristic can be expressed in the form 
of an equation, the operating point can be found by equating the torques for motor and 
load and solving to find the angular velocity. In the present case this solution involves 
constructing and solving a quadratic equation using the standard formula: 

4>ad = Cotor 

6.667 + 0.02222a)' + 0.000 9259co' 2 = 5(50 tt - to') 


(6.667 - 250tt) + (0.02222 + 5) to' + 0.000 9259a/ 2 = 
0.0009259a)' 2 + 5.022 22a)' - 778.7 = 0 


0 


, = -5.02222 ± \j 5.022 22 2 + 4 X 0,000 925 9 X 778/7 
2 X 0.000 9259 

= 150.86 rad/s (or —5575 rad/s, which is clearly incorrect) 

L = 5(50tt - 150.86) = 31.1 Nm (4.93) 




Matching of starting characteristics 

Another common situation is where it is desired to start a load from rest. If the 
load does not have a large inertia, or if the mechanical power source has a large 
torque at zero speed (and is not damaged by stalling or running at low speed), 
this may not be a problem. However, in many practical situations, starting can 
be a problem. For example, internal combustion engines will not run in a stable 
manner at low speed (they stall easily, as any learner driver knows!), while large 
electric motors will burn out if they are required to provide a large torque at low 
speed for any length of time. Several strategies can be adopted: 

• alteration of the characteristics of the mechanical power source, for example, 
by driving an induction motor from an inverter so that it gives a good torque 
as the frequency is raised from rest (see, for example, Figure 4.82). (Note: this 
diagram assumes that the frequency is increased slowly so that the steady- 
state operating points are reached at each value of speed. For rapid ramping 
of frequency, the torque required to cause acceleration will, of course, he 
significant and the locus of combinations of torque and speed will not 
coincide with the steady-state torque-speed curve of the load.); 



Figure 4.82 Locus of operating 
points as a combined friction and 
windage load is accelerated slowly 
from rest using an inverter-driven 
induction motor 
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• a clutch used to allow a mismatch in speed between mechanical power source 
and load, whilst allowing a given value of torque (equal to the slippage torque 
of the clutch) to be transmitted. The power source can now run at a stable 
speed while the load accelerates from rest under the influence of that torque. 

Examples include: 

— the process of getting a car moving from rest, allowing the clutch to slip 
gently as the speed of the car picks up 

— the engagement of a clutch on a lathe to start the spindle (which may 
carry a heavy workpiece with a large value of inertia), having earlier started 
the lathe’s induction motor under no-load conditions; 

• the use of variable-ratio drives (for example hydraulic variable-ratio drives) to 
provide a means of running up the device from rest; 

• choosing a mechanical power source that has a torque-speed characteristic 
which is well suited to the differing demands of starting and running. This 
involves a high torque at low speed (without damage to the power source or 
any ancillary equipment, e.g. power supply, occurring due to running under 
stall conditions) combined with lower torques at high speeds. Good examples 
of motors with these characteristics (see Figure 4.83) include: 
o pneumatic motors 
o hydraulic motors 
o series-wound electric motors. 

Learning summary 

By the end of this section you should have learnt: 

/ how mechanical power sources and loads interact; 

/ how a transmission system may be used to match the power source to the load; 

/ to calculate the combination of torque and speed at which a load and power source will operate 
under steady-state conditions; 

/ the function of a clutch with particular reference to starting a mechanical load from rest; 

/ why some sources of mechanical power are better suited to starting of loads than others. 



characteristics of motors which are 
well suited to starting 
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Feedback and control 
theory 


UNIT OVERVIEW 

■ Introduction 

■ Feedback and the concept of control engineering 

■ Illustrations of modelling and block diagram concepts 

■ The s domain: a notation borrowed from mathematics 

■ Block diagrams and the 5 notation: the heater controller and tensioning system 

■ Working with transfer functions and the 5 domain 

■ Building a block diagram: part 1 

■ Building a block diagram: part 2 

■ Conversion of the block diagram to the transfer function of the system 

■ Handling block diagrams with overlapping control loops 

■ The control algorithm and proportional-integral-derivative (PID) control 

■ Response and stability of control systems 

■ A framework for mapping the response of control systems: the root locus method 


5.1 Introduction 

There are numerous situations in engineering where it is desirable to provide power to a 
machine in order to make it run at a chosen (and variable) speed, flow rate, pressure etc. In 
many of these cases, there is a need to maintain the speed or other attribute of a system at 
a fixed value despite changes to the environment in which it operates. Examples including 
maintaining the speed of a car at a fixed value regardless of road gradient and wind, maintaining 
the course of an aircraft regardless of the motion of the air in which the aircraft flies, and 
adjusting the rotational speed of a DVD to maintain the required data rate regardless of the 
changing radius at which the laser beam reads the disc. 

This unit initially concentrates on the concept of feedback and control, and on the 
mathematical concepts used in modelling a control system. The basic concepts are outlined 
with a minimum of mathematical overhead, then the mathematical tools required to analyse 
realistic situations (such as those involving dynamics) are introduced one by one until systems 
can be modelled, analysed and characterized in terms of their performance and stability. 

The concept of automatic control is exemplified by systems such as the cruise control on a 
vehicle, which maintains a set speed regardless of gradient, wind speed etc., and the autopilot 
on an aircraft, which maintains course, altitude and speed without human intervention. 


317 


An Introduction to Mechanical Engineering: Part 2 


However, although control systems are now present in numerous everyday situations, they are 
frequently so invisible or transparent in their operation that the user may be totally unaware 
of them. For instance, modern car engines continuously take data from sensors to adjust the 
engine to optimum performance regardless of ambient temperature, televisions maintain 
stable pictures despite fluctuations in supply voltage and frequency, and cameras automatically 
compensate for varying levels of illumination. By contrast, 30 to 40 years ago, cars were 
equipped with a manually operated choke to compensate for a low engine temperature, 
televisions frequently needed adjustment by the user to prevent the picture from breaking up, 
and cameras needed manual setting of focus, shutter and aperture. The present unit aims to 
introduce some of the concepts involved in automatic control, beginning with a simplified 
approach involving a minimum of mathematics in order to examine: 

• feedback 

• block diagrams 

• error 

• the control algorithm. 

The unit then progresses to examining some of the mathematical tools of classical control 
theory that are required to analyse and understand real control systems. These tools include: 

• the concept of Laplace transforms to allow analysis of systems within ‘s space’ 

• the concept of the transfer function 

• analysis of initial and final behaviour and frequency response 

• techniques for determining stability of a control system 

• a technique for visualizing the behaviour of a system. 

This unit can only give an overview of some of the simpler topics in control engineering, 
and the approach taken here is to present a variety of worked examples rather than going into 
excessive depth on the theory or techniques. Numerous texts are available that cover control 
engineering in more detail and from a more rigorous mathematical perspective including the 
current edition of the classic text by Dorf and Bishop (2008). A slightly earlier but excellent 
text is that by Van deVegte (1993). 


5.2 Feedback and the concept of control 
engineering 

Concept, history and simple examples 

Feedback may broadly be defined as the comparison of a measured physical variable 
with the desired value of that variable with the aim of taking action to minimize 
the difference between the measured and desired values. 

The original application of feedback in control of engineering plant appears to have been the 
centrifugal governor originally applied to steam power by James Watt in 1788. As the governor 
rotates and the weights fly outwards, a steam valve is closed so that engine speed is maintained 
at an approximately constant value. In fact, Watt s development of the steam engine governor 
was based closely on the use of a similar device invented a year earlier by the millwright 
Thomas Mead for the control of the settings (but not necessarily the speed) of grinding 
wheels in windmills (Bennett, 1979). Figure 5.1(a) shows one of the governors on what is 
thought to be the last set of engines built by James Watt and Co. in 1882-4, still in working 
order in the preserved Papplewick Pumping Station near Nottingham. At the time of writing, 
there is a similar engine (also with a centrifugal governor) in working order in Wollaton Park 
immediately to the north of the University of Nottingham. 
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0 ^ Governor is driven by engine at 

4$’ a speed proportional to engine speed 



governor 


Figure 5.1 (a) Governor at Papplewick pumping station; (b) schematic operation of typical centrifugal governor 
(Papplewick Pumping Station Trust) 


The process by which the governor operates (Figure 5.1(b)) is: 

• if the engine speeds up beyond its desired speed, the masses on the governor move 
outwards under centrifugal action; 

• the movement of the masses is transferred, via a collar, to a control rod that slightly closes a 
control valve supplying steam to the engine; 

• this causes the engine to slow down; 

• by means of this process, known as negative feedback (for example, where an increase 
in speed results in an action which causes the speed to be reduced) the engine s speed is 
automatically maintained at a value close to the desired value. 

Conversely, if the engine speed falls unexpectedly, the governor will cause the control valve to 
open slightly above its current setting, causing the engine to speed up. 

Since the development of the centrifugal governor, the understanding of control systems has 
developed to incorporate a variety of concepts and theories. These include the application 
to control of Laplace transforms, proportional-integral-derivative (PID) control, transfer 
functions, stability, root locus plots etc. These developed into the discipline we now know 
as classical control theory in the years during and after World War II.The history of the 
discipline is covered in depth in two books by Bennett (1979 and 1993); these topics form the 
basis of the control-related material in the present unit. 

Everyday example of closed-loop feedback control: driving a vehicle at con- 
stant speed 

The task of maintaining the constant speed of a vehicle provides an excellent example of 
closed-loop control. In the case of a car without cruise control (or with the cruise control not 
engaged) one aspect of the driving process is: 

• a speed is chosen (for example, the driver might choose to drive as fast as possible to reach 
an appointment in time but without breaking the law) — this is taken to be the ‘demand’ for 
the system; 

• the current speed is measured via the speedometer, which is observed by the driver; 

• the driver compares the two and assesses the difference (known technically as the error, 
which is the desired value minus the measured value); 
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• equipped with the skill of how to drive a car, the driver takes a decision on how to correct 
the speed based on the value of error, and also based on an understanding of how well the 
speed is converging to the desired value - this decision-making process is very important 
even though it is probably taken for granted within a human control situation; 

• the driver attempts to correct the speed by raising or lowering his foot on the accelerator 
(gas) pedal; 

• depending on issues such as the mass of the car, the power of the engine etc., the car will 
respond by gradually increasing or decreasing its speed; if the driver is experienced, the car’s 
speed will rapidly converge on the desired value, hut if the driver is a learner, the speed is 
very likely to oscillate about the desired value until the driver hits upon the correct position 
of the accelerator. 

This process is illustrated via a simple block diagram in Figure 5.2. 


Corrective action 



Figure 5.2 Block diagram of the process of driving a car 


An obvious but essential point to note is that feedback is negative: for example, if the car 
is travelling too fast, the driver partially releases the accelerator to allow it to slow down. 
(Positive feedback is explored later in this section but would not be helpful in this situation — 
it would involve the driver pressing the accelerator ever harder as the car’s speed was seen to 
increase!) 

This process can of course be automated, and a system for doing so (known as the cruise 
control) is available in some modern cars. The control loop (Figure 5.3) remains essentially 
the same as before, except that the decision-making process requiring the skill of the driver is 
now replaced with the automated decision-making process (known formally as the control 
algorithm) forming the heart of the cruise control system, and the actuation of the engine’s 
throttle takes place under the command of the cruise control rather than via the driver’s 
operation of the pedal. 


Desired speed Cruise control 



Figure 5.3 Block diagram of a cruise control system (greatly simplified) 


The control algorithm is carefully designed to take account of the way in which the car’s speed 
is changing, in order to avoid over- or under-shooting the desired speed, or causing the speed 
to fluctuate about its desired value. 
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Less constructive examples of feedback 

The fact that control systems generally employ negative feedback has already been noted — for 
instance, if a car is going too fast, the corrective action involves sending a signal to slow the 
car down. Although positive feedback is occasionally used to beneficial effect, it is generally 
an undesirable phenomenon, as any discrepancy between desired and actual values of system 
behaviour results in a signal that makes that discrepancy greater, giving a form of instability. A 
powerful example of the destructive nature of positive feedback is the situation (a ‘bank run’) 
which occurs when a bank starts to encounter ‘liquidity problems’, i.e. when it starts to run out 


• customers begin to hear that the bank is short of cash and become worried about the 
security of their savings; 

• they therefore withdraw their savings from the bank so that they can keep them safely 
elsewhere; 

• this means that the bank has less cash, and more customers get worried about its problems; 

• even more customers withdraw their savings . . . 

. . . and so on until either the bank fails completely (as happened numerous times in the 
1930s) or the process is halted by state intervention (as in the case of Northern Rock in the 
UK in 2007) . Other examples include the phenomenon in electronics known as thermal 
runaway, which relates to a device that turns electrical power into waste heat as a by-product 
of its normal operation, and tends to do this more vigorously as its temperature rises. Unless 
precautions are taken, the amount of power it converts to heat will keep increasing as its 
temperature rises, in turn causing it to get hotter still, until the component burns out. An even 
more catastrophic example was the Chernobyl nuclear disaster, which can be regarded as a 
complex form of thermal runaway. The more the cooling water in the nuclear reactor boiled 
to form steam, the more heat the nuclear reaction generated. This created yet more steam, 
eventually causing a disastrous explosion. 


5.3 Illustrations of modelling and block 
diagram concepts 

Simple example of feedback and control: modelling of a heated 
system 


Real control systems usually include some form of time-dependent behaviour, but it is 
useful to illustrate the modelling process via a very simple control system in which time- 
varying effects are neglected. Consider a heater unit for a set of railway equipment, designed 
to maintain the equipment at an elevated temperature to avoid it icing up. The railway 
equipment has an exposed surface area of A = 0.5 m 2 and a convective heat transfer coefficient 
of h = 5 W/m 2 .°C (more correctly represented as h = 5W/m 2 .K).A heater provides a 
power p of heat input into the railway equipment. It will be assumed for simplicity that the 
temperature of the system’s surroundings is 0°C. It is easy to show that if (for example) p = 
10W and the system is allowed to settle down to a steady temperature t, the temperature 
achieved (which we may regard as the output of the system) is obtained by equating the power 
input p to the rate of heat loss: 


of money: 


p(t) = Ahj(t) 


(5.1) 


This is solved to obtain 



(5.2) 
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which, for the above values, gives a temperature of: 

t « = 0TV5 = 4 ° c I 5 - 3 ' 

Note that the values that are properties of the system (A and h) are constant, i.e. invariant 
with time, while all variable quantities p and t are expressed in the form of functions of 
time t, though in this example we will assume that such quantities have had time to settle 
down to their steady-state values, or that that the thermal capacity of the system is negligibly 
small. (The importance of this apparendy obvious statement will become clear later as a 
slightly different convention is normally followed in control engineering: variable quantities 
will be expressed as a function of a complex variable s, rather than time; see section 5.4.) 

This very simple relationship between power input and temperature may be (trivially) 
represented using a block diagram as shown in Figure 5.4. 

In the block diagram, the mathematical relationship between an input (in this case, power p) 
and an output (actual temperature t) is shown as a ‘block’ which acts upon ‘flows’ of values or 
data, specifically multiplying the input by the value of the constant contained in the block in 
order to obtain the output. 

Note: In Figure 5.4 the fact that the variables p and t are functions of time is deliberately not 
shown. In fact we would never normally draw the block diagram directly with variables as 
a function of time; they would instead be expressed as a function of the complex variable s 
which will be introduced in Section 5.4. 



Figure 5.4 


Suppose that we now wish to maintain the temperature of the equipment at a given value 
Tf)(t). In order to automate this process, a very simple control system is introduced which drives 
the heater with a power that is proportional to the difference between the desired temperature 
and the actual temperature, so that for each degree Celsius of temperature difference, an extra 
power of 10W will be supplied where this constant of proportionally will be termed the gain K 
of the controller. As will be explained later, this is a very simple form of the control algorithm 
(decision-making process) and is known as proportional control. It may he represented 
mathematically as: 


p{t) = lC(T D (f) - T(f)) 


(5.4) 


and within a block diagram as shown in Figure 5.5. 



Figure 5.5 


The block diagram in Figure 5.5 now involves not only the input setting t d (demand 
temperature), the output t (actual temperature) but now also an intermediate quantity t d — t, 
known as the ‘error’. Also visible is a ‘differencing junction’ where t is subtracted from T D .The 
block diagram for the complete control system can now be assembled by joining the loose 
ends’ in the block diagrams for the different aspects of the system as shown in Figure 5.6. 



Figure 5.6 


Note that, because we wish to treat t as an output from the system as well as feeding it back, 
a branch is introduced into the flow; no additional symbols are involved in this. The block 
diagram can be simplified slightly by combining successive blocks as shown in Figure 5.7. 
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Figure 5.7 


Assembling the block diagram corresponds 

P(t): 

T (f) = Ah K (^(t) ~ t ( f)) = 


o combining equations (5.2) and (5.4) 

JC(T D (f) - T (t)) _ Kj D (t) Kft) 

Ah Ah Ah 


eliminate 

(5.5) 


It would appear that we have a circular relationship in which t (t) appears twice. However, 
equation (5.5) can be rearranged to group the t ( f) terms together: 


T(f) + 


m) 

Ah 




KJn(t) 

Ah 


(5.6) 


It is now straightforward 
(demand) T D (i): 


rearrange this to give the output T (t) in terms of the input 
Kr D (t) 


T (t) - 


_K\ (Ah + K) 


T D (<) 


so that the error is: 

Td« - T (t) = T u (t) ~ ^^T D (f) = ^f^T D (t) 


(5.7) 


(5.8) 


The block diagram for the whole system can now be expressed in simplified form in 
Figure 5.8, where the contents of the ‘box’ are the ratio of the output to the input. This 
concept will be extended within the more complex analysis given in Section 5.4 and 
5.5 and will be referred to as the transfer function. 

The issues of manipulating and simplifying block diagrams will be explored in detail in 
Section 5.9. It is important to note that, with this simple (proportional) control algorithm, 
the desired temperature is never actually reached. For example, with the above choice of gain 
K = 10 W/°C, the temperature t (t) settles down to the following value: 



To 


Figure 5.8 


= (0.5 X^T 10) Tr ^ = °' 8td ^ 


(5.9) 


so that a demand temperature of 5°C will result in an actual temperature of 4°C. Increasing the 
gain will result in the temperature being more closely approached; for example a gain of 
K = 100W/°C will result in a temperature of 

T(f) = (Q.5 xTt 100) Tp(t) = °' 976 t d(0 ( 5 - 10 ) 


being achieved, for example 4.88°C for a demand of 5°C. It will be seen later, however, that 
there can be a penalty (in terms of system stability) associated with making the gain of a system 
larger than necessary. 


Example of feedback and control: modelling of a fibre tensioning 
system 

Another introductory example involves a slightly more complex control system, again with 
time-related terms such as inertia being neglected, and involves only mechanical components. 
Consider a purely mechanical tension control system for a filament winding machine (used for 
producing fibre-reinforced composite containers) in which the dry fibres are held on a 
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Figure 5.9: Diagram of mechanical fibre tensioning system 
(frictional forces omitted for clarity) 


cardboard spool of radius r, braked via a drum of radius R and coefficient of friction |x with a 
force related to the fibre tension / r and to a pre-load setting^ applied to the braking system 
(Figure 5. 9). The concept is that the brake provides a frictional force on the drum in order to 
tension the fibre, the pre-load contributes to the frictional force (and can he adjusted to vary 
the tension), and the tension feedback pulley provides a mechanism for reducing the braking 
force if the fibre tension becomes too large. In this system, the pre-load setting^, is the input to 
the system, and the fibre tension /, is the output. 


The block diagram, and hence the relationship (transfer function) between input and output 
f r can be derived by considering each part of the system in turn. 


By taking moments about the fulcrum O in a free-body diagram of the 
brake lever and pulley (Figure 5.10), the forced on the brake shoe is 
found from: 

2/(2/ x (f) -/ P (f)) + = 0 

f B (t) = 2/ P (t) - 4/ T (f) (5.11) 

or Ut) = 2(/ P (f) - 2 / t (0) 

Again we note that the length l, and for that matter quantities such as 
R, r and |x, are all constant, i.e. invariant with time, while all variable 
quantities f B ,fr and f B are expressed in the form of functions of time 
t. In the present version of the example we will assume that such 
quantities only change very slowly, and/ or that the inertia of the 
rotating components is negligibly small. The behaviour of the brake 
lever and pulley is represented as a block diagram in Figure 5.11. 



Figure 5.11 Block diagram for lever and pulley 



Figure 5.10 Lever and brake assembly 
(braking friction is assumed to have 
negligible moment and is omitted for clarity; 
reaction forces at O also omitted) 


In the block diagram, the mathematical relationships between quantities such as the input 
settingup, the output f r (fibre tension) and intermediate quantities such as brake force are 
shown as ‘blocks’ that act on ‘flows’ of values or data. In this particular block diagram, blocks 
involving multiplication by constant factors (gains) of value 2 and 4 are evident. Also visible is a 
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‘differencing junction’ where 4/ r is subtracted from 2 f v to give ^3. Again, the 
fact that/p,/ r and/ B are functions of time is deliberately not shown; the correct 
representation of time- dependent quantities within block diagrams via the 
complex variable 5 will be introduced in Section 5.4. 

Provided that inertia effects are ignored, or that no acceleration or deceleration 
is occurring, the brake drum (Figure 5.12) is similarly analysed as follows: 





Figure 5.12 Brake drum 


Mt) 


p. R 


(5.12) 


This analysis can be represented via a block diagram (Figure 5.13). 


By virtue of the feedback inherent in the system, there is a circular relationship involving 
/ B and j Y since the application of the braking forced results in an increase in the fibre 
tension / r , but the lever system results in /, acting to reduce the braking force / B . This 
simple system could be solved algebraically without further ado, hut it is instructive first 
to draw the complete block diagram for the system, in which the feeding-back of f r for 
eventual subtraction from/p is now clearly visible in Figure 5.14. 


/b 

|xR 

It t 


r 



Figure 5.13 Block diagram 
for brake drum 



Figure 5.14 Block diagram for fibre tensioning system 


Figure 5.14 may be rearranged to bring the leftmost block inside the loop as shown in Figure 
5.15, and may be further simplified into the form shown in Figure 5.16 by combining the 
transfer functions of successive blocks by multiplication. 



Figure 5.15 Block diagram of tensioning system after rearrangement 



Figure 5.16 Block diagram of tensioning tensioning system represented 

system after multiplying successive blocks using a single transfer function 


The system may now be solved as follows: 

ix R ix R ix R 

m = 2 ^(/ P (t) - 2 / x ( 0 ) = 2^-m - 4 
M)(l + 4^f) = 

|xR 

2 ~T 2|xR 

/t(0 ~ L + 4 i^) m ~ ( r + 4p*R) fv{t) 
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In this case the ratio f\ (t)/fp(t) of fibre tension to pre-load force (effectively the ratio of output 
force to input force) is simply 2|jL-R/(r + 4|jlR), and a block diagram which represents this is 
shown in Figure 5.17. 

The effectiveness of the control provided by this system can now be observed for the following 
parameters: r = 50 mm, R = 100 mm, p, = 0.4, F P = 2.5 N.A 20% reduction in r (to 40 mm) 
as the spool of fibre unwinds results only in a 5% increase in tension from 0.95 N to 1 N — 
without the feedback system, the change in tension would be around 20%. 

If we wish to consider the inertia of the spool and brake assembly (which would be significant, 
and would need to be included in the analysis in order to understand the tension in the fibre) 
then the simple (time domain) mathematics described above is insufficient to express the 
situation, and an additional tool is required to take account of rates of change within a system. 
This will be described in the next section. 


Learning summary 

By the end of this section you should have learnt: 

/ the concepts of closed-loop control and feedback, and their typical applications; 

/ the difference between positive and negative feedback; 

/ the concept of a block diagram representing the characteristics of a control system and the 
manner in which feedback is provided. 


5.4 The s domain: a notation borrowed 
from mathematics 

It is very difficult to analyse anything except the most elementary systems (such as the ones 
described above) in the time domain. This is because practical systems involve features such 
as inertia and damping, which require derivatives such as acceleration and velocity to be 
incorporated into the model. In such a case, the block diagram effectively becomes a graphical 
representation of a differential equation rather than a simple algebraic one. In order to tackle 
such systems, analysis within classical control engineering is generally undertaken using an 
adaptation of the methods of solving differential equations, using a notation based on one of 
two complex variables: 

— the variable known as s, involving analysis in the s domain, used for modelling continuous 
control situations such as the ones covered in this chapter. This notation is closely related to 
the mathematical approach involving Laplace transforms, which are just a useful way of 
solving differential equations by converting them to polynomials in s ; 

— a different complex variable, known as z, involving analysis in the z domain, used for 
modelling discrete and digital control situations. This will not be considered further in the 
present chapter, but will be of interest in the case of more advanced control situations. 

Within the present chapter, the behaviour of control systems and their components, including 
modes of behaviour that involve derivatives and integrals with respect to time, will henceforth 
be expressed in terms of the variable 5. The concept of the s domain, the meaning of 5 as 
a variable, and the relationship to Laplace transforms, can all be discouraging or confusing 
to engineers whose main concern is to solve practical control problems. It is therefore 
convenient here to view the variable s and the 5 domain as a language or notation for processes 
involving integration and differentiation, rather than becoming too concerned about either 
the physical meaning of 5 or its use as a tool for solving differential equations; there are 
numerous introductions to the technique in the literature, such as Chapter 6 of the engineering 


326 


Feedback and control theory 


mathematics text of Kreyszig (2006). For completeness, however, it can be stated that the 
Laplace transform F(s) = !£{f(t)} of a time-domain function/(f) is defined as: 

m = 2 {/«} = l e-jmt (5.i4) 

where s may be interpreted as complex angular frequency, or radians per unit time. By 
convention, as shown in equation (5.14), variable quantities expressed in the time domain (i.e. 
as a function of time) are denoted by lowercase letters, e.g./(f), while the quantities expressed in 
the s domain are denoted by upper-case letters, e.g. F(s). 

Laplace transforms of a large variety of functions are tabulated in the mathematical literature 
(Kreyszig, 2006, pp 264-7; Healey, 1967). For the purpose of the present chapter, Table 5.1 
may be considered to be the most useful: 


Function/(f) (valid only for 1 3= 0; 

Graph 

Laplace transform F(s) 

assumed to by zero for t < 0) 




1 

Unit step function (Heaviside 
function) H(t): 

/(f) = 0forf<0 
f(t) = 1 for t > 0 

(Note: definitions vary for the value of 
/(f) for t = 0) 

M 


1 




2 

Unit ramp function fH(f): 
/(f) = 0 for f « 0 
/(f) = f for f > 0 

m. 


i 

/ 


* 

3 

Unit impulse function (Dirac delta 
function) 8(f): 

/(f) = 0 for f < 0 
/(f) = oo for t = 0 
/(f) = 0 for f > 0 
(area enclosed is unity) 

1 


i 



4 


M. 

i 

i 

s + a 

— 


5 

1 - e~ at 



KfVd) 



6 

f--j(1 - e-“) 

M 

y 

s 2 (s + a) 


’ 


Continued overleaf 
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Table 5.1: Laplace transforms of some commonly encountered functions 


Laplace transforms also obey various rules which include the following: 

(1) Laplace transforms are linear, that is they obey the principle of superposition. For example: 

wco + km = zwm + %Mt) i 

= + b%{g(t)} = aF(s) + bG(s) 

In particular it can be seen from this example that: 

• the original functions f(t) and^(t) are multiplied by the constants a and b respectively, 
and so are their transforms F(s) and G(s); 

• the transform of the sum of af(t) and bg(t) is the sum of their transforms. 

Similarly, the transform of a difference, e.g. af(t) — bg(t) would be aF(s) — bG(s). However, 
the unwary student should note that this concept does not extend to the situation where 
functions of t are multiplied together: the transform of f(t)g(t) is not the product of their 
transforms, as this does not involve superposition or linear combination. 


(2) The Laplace transform of a derivative is 




\ mn 

{ d t I 


= s£{f(t)} -/( 0) 


(5.16) 
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so that, in turn, the transform of a second derivative is 

%{^f\ = s 5 £ {^f] _/,(0) = ^ m} _/,(0) " 5/(0) (5 - 17) 

d /(£) 

where /'(0) is the value of . at £ = 0. 


(3) The Laplace transform of an integral is 


^/ Q /(T)d T } {/(£)} (5.18) 

(4) The Laplace transform of a time-shifted function, where the time axis has been changed 
by a constant value t, is as follows: 

if = m then S{f<f - T)} = e-”F(s) (5.19) 


Of particular interest here is the observation that multiplication by s (and subtraction of the 
initial condition, which in a control system is typically zero) represents differentiation, while 
division by s represents integration. This ‘shorthand’ for these calculus operations is encountered 
often in dealing with control systems. For readers familiar with the ‘operator D’ method of 
solving differential equations (Kreyszig, 2006, pp. 59—61; Spencer et al, 1977, pp. 17—18 and 41), 
there are significant similarities between the operator D (where multiplication by D represents 
differentiation and division by D represents integration) and the way in which differentiation and 
integration in the time domain are represented respectively as multiplication and division by s. 


The procedure for using Laplace transforms to solve ordinary differential equations is: 


(1) with knowledge of the initial conditions, the differential equation is transformed from the 
time domain to the s domain; 

(2) by use of partial fractions or otherwise, the transformed equation is simplified to give the 
solution in the s domain; 

(3) the simplified equation is transformed back into the time domain to give the solution (in 
control engineering, this step is usually not necessary and the behaviour of the system is 
generally left as an expression in s). 


A straightforward example illustrates the power of Laplace transforms 
in solving differential equations. A system (Figure 5.18) consisting of 
a spring of stiffness k in parallel with a viscous damper of constant c 
is subjected to a constant force p(t) = q beginning at time £ = 0. For 
traceability throughout the following derivation, the forcing term is 
highlighted in green, the spring term in red and the damping term 
in blue, up to the point where the equations are rearranged and the 
individual terms are no longer identifiable. 

It will be assumed that the link joining the spring to the damper 
has no significant mass. Applying equilibrium to the link: 

dx(£) 

Pit) = kx(t) + c~- 


— CK- 


Figure 5.18 Spring-damper system 


At 


£ = 0,x = 0 
p(t) = 0 for £ < 0 
p(t) = q for £ > 0 


(5.20) 


Mathematically this load can be expressed in terms of the Heaviside step function h(t) (see item 
1 in Table 5.1): 

p(t) = qH(t) (5.21) 


329 


An Introduction to Mechanical Engineering: Part 2 


C d t 


so that the equation of motion now becomes 

qH(t) = kx(t) H 
Taking Laplace transforms (see equation (5.14)) gives 

- = kX{s) + csX(s) - cx{ 0) 
= (k + cs)X(s) ~ cx( 0) 


(5.23) 


Rearrangement gives 


oc(0) 


w s(k + a) k H 

Using partial fractions this can be expressed as 


cv(0) 


w sk k(k + cs) k + cs 

Inserting the initial condition that x(0) = 0, and rearranging, gives 

q i n i 

*(,)=7 


k(s+ k/c) 


(5.25) 


From items 1 in Table 5.1, it is seen that the inverse transform of 1/s is 1 for t > 0, and 
similarly the inverse transform of 1/(5 + a) is e~ at for t > 0; these items are of the same form 
as the terms in equation (5.25). Assuming that a = k/c and inserting these inverse transforms 
into equation (5.25) gives the well-known solution to the problem of a spring-damper system 
under a step load: 


= t 1 -e 


(5.26) 


Instead of using partial fractions, it is often possible to make use of more complicated inverse 
transforms to get straight to the answer. For example, equation (5.25) can be expressed as: 


X ^ = T 


k \ 5(5 + k/c) ) 

n 5 in Table 5.1, it is seen that the inverse transform of 


5(5 + a) 

is 1 — e~ at 

Assuming as before that a = k/c and substituting this inverse transform into equation (5.27) 
once again recovers the solution equation (5.26), this time without any further working being 
required. 


Learning summary 

By the end of this section you should have learnt: 

/ the concept of the Laplace transform and its role in the solution of differential equations; 

/ the basic rules relating to application of Laplace transforms to problems involving integration, 
differentiation and time-shifting; 

/ how to transform a range of expressions from the time domain into the 5 domain. 

^ J 
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5.5 Block diagrams and the s notation: the 
heater controller and tensioning system 
examples revisited 

Heater controller 

The real strength of the 5 notation, when applied to control engineering, comes when it is used 
in conjunction with block diagrams to model systems whose behaviour involves differentiation 
or integration with respect to time. Revisiting the simple example of the heated system 
modelled earlier, consider the heated equipment to have a mass of m = 50 kg and a specific heat 
capacity of C p = 500J/kg K. Now, it is assumed that the heating power supplied and the rate of 
heat loss are no longer equal (so the system is no longer in thermal equilibrium), and the net 
rate of heat gain causes the temperature to increase: 

dT(f) 

p{t) - AhT(t) = mC p -g- (5.28) 


Assuming initial conditions (i.e. assuming that T (t) is initially zero), and making use of equation 
(5.16), the Laplace transform of this expression is 

P(s) ~ AhT{s) = mC p sT{s) (5.29) 


This can be rearranged as 

_ v P(s) - AhT(s) 
T{S) mC„s 


(5.30) 


The block diagram (Figure 5.19) for this aspect of the system is now slightly more complex 
than before and shows the circular relationship inherent in equation (5.29), in which T(s) is 
defined via a function involving T(s) itself. 



Figure 5.19 Block diagram relating to 
heat balance of heating system 


This circular relationship can be eliminated from the mathematics by straightforward 
manipulation of equation (5.30): 

AhTjs) mCpSTjs) + AhTjs) 
mC„s 


Tis) H 


mC p s 
mC„s + Ah 


mC p s 

> T (s)=~ 


T ,,_ m 

{) mC„s 


TP(s) 


(5.31) 


We could use this to simplify the block diagram relating to this aspect of the problem, but 
for illustrative purposes the loop in the block diagram will be left in place and will instead be 
tackled within Section 5.9. 


The controller still follows the same approach as in the simpler version of the problem, i.e. 
the power supplied to heat the equipment is proportional to the error, which is the difference 
between desired and achieved temperatures: 
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p(t) = K(T D (t) - T(f)) (5.32) 

This expression is very straightforward to transform to the s domain: 

P(s) = K(T d (s) - T(s)) (5.33) 


and is represented by substantially the same block diagram as before 
(Figure 5.20). Compare this with Figure 5.5; it is now (correctly) expressed 
in terms of the complex variable s. 

Assembly of the complete block diagram now results in a diagram which 
involves two nested loops (Figure 5.21). 

This block diagram will be simplified to a straightforward transfer function 
(ratio of output to input in the s domain) in Section 5.9. 



Figure 5.20 Block diagram relating to 
controller 



Figure 5.21 Complete block diagram of heater controller system 


Fibre tensioning system 


Revisiting also the fibre tensioning system example, now consider the spool/brake drum 
assembly to have inertia J. Also, consider that the fibre is being drawn through the system at a 
velocity u(t) (which is equal to r0(t)) and acceleration du(t)/dt, so that the angular acceleration 
0(0 of the drum is 


0(0 = 


i <MQ 

r dt 


(5.34) 


The analysis starts in almost the same manner as the static analysis of the system undertaken 
earlier. Equation (5.11) is still applicable: 


/ b (0 = 2(/ P (0 - 2 / x ( 0 ) 


(5.35) 


Equation (5.35) straightforwardly transforms to the 5 domain to give 


F b (s) = 2(F P (j) - 2 F r (s)) 


F b (s) = 2F v (s) ~ 4 F r (s) 


(5.36) 


The block diagram for the lever is now unchanged from the version in Figure 5.11 other than 
being expressed in 5 space (Figure 5.22). 



Figure 5.22 Block diagram for lever system 


However, introducing inertia into equation (5.12) results in an expression of the form L = /0: 


/ x (0r - p/ b (()R =J0 = 


(5.37) 
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Rearranging gives 


| xR 

fr(t) = “~/b(0 


J du(t) 
r d t 


(5.38) 


Making use of equation (5.16), the Laplace transform of this expression (assuming that u(t) and 
du(t)/dt are both zero initially, i.e. at t = 0) is now 

| xR J 

Ft(s) = ^~Fb(s) + s J ~U(s) (5.39) 

The block diagram (Figure 5.23) for the brake/ drum assembly is therefore somewhat more 
complex than before and now involves a summing junction. 



Figure 5.23 Block diagram of brake/ 
drum assembly including inertia 


By combining the block diagrams for the lever and for the brake, and joining the arrows 
relating to F B (t) and F x (f), the overall block diagram for the system can now be assembled in 
Figure 5.24. 



Figure 5.24 Block diagram of tensioning system 


It will no longer be possible to express the transfer function purely as a ratio of output to input, 
as effectively there are now two inputs to the system (pre-load f B (t) and filament speed u(t)). 

By way of introduction to the manipulation and simplification of block diagrams, note 
that proceeding along the main flow of the block diagram leads to the following (circular) 
expression, involving F-] (s) on both sides of the equation: 

ixR sj 

F t (s) = (2F v (s) - 4 F r (s))~ + U(s )f 

IX R L iR sj 

= 2Fp(s) - 4F t (5) + U(s) f (5.40) 
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Rearranging to take F T ( s) to the left-hand side of the equation gives 


Multiplying throughout by r gives 

F r {s)(r + 4pjR) = 2F p (s) ( jlR + U(s)sJ 
Finally, dividing throughout by (f + 4p.R) gives the transfer function: 


F r (s) = 


2(jt R 

(; r + 4|X R) 


F P (i) H 


(r + 4|xR) 


U(s) 


(5.41) 

(5.42) 

(5.43) 


Learning summary n 

By the end of this section you should have learnt: 

/ the use of block diagrams involving the 5 domain; 

/ the construction of the block diagram of a system involving dynamic behaviour such as 
mechanical or thermal inertia. 

V ) 


5.6 Working with transfer functions 
and the s domain 


Open- and closed-loop transfer functions 

It has already been mentioned in passing that a transfer function gives the relationship 
between the output and input. In general, the transfer function of a simple system (e.g. that 
given in Figure 5.25) is expressed as a function of 5 relating the Laplace transform Y(s) of the 
output y(t) of a system divided by the Laplace transform X(s) of the input x(t), for example: 



Figure 5.25 A block 
diagram representing a 
transfer function G(s) 


G(s) = 


X(s) 


(5.44) 


While this is straightforward for a simple system having a block diagram 
consisting of a single box, practical control systems are more complex and, 
like the example of the tensioning system, nearly always involve some form 
of (usually negative) feedback. Such systems typically have a block diagram of 
the form shown in Figure 5.26. 

In such a system: 

• G(s) typically represents the transfer function of the item of equipment 
being controlled, along with the transfer function of the controller; 


Figure 5.26 A typical feedback loop 


• H(s) typically represents the transfer function of the sensor or transducer being used to 
measure the output of the system and compare it with the input or demand signal. 


Two versions of the transfer function for this system can he defined: 

• Open-loop transfer function. There are two conventions for this, which can be confusing 
for the unwary student. For the typical system shown in Figure 5.26, the open-loop transfer 
function is sometimes defined simply as G(s), the relationship between output Y(s) and 
input X(s) for the condition that the feedback does not actually take place (Figure 5.27). Flowever, 
the open-loop transfer function is also sometimes defined as the relationship between the 
input and the value which would be subtracted from the input, again for the condition that 
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no feedback actually takes place. In other words, it is sometimes 
defined as G(s)H(s), the product of the transfer functions of the 
two boxes through which the signal passes on its way back to 
the differencing junction: 

The latter version of the open-loop transfer function is of particular 
interest in predicting the stability of a control system from 
experimental measurements without actually causing the control 
system to go unstable, and is the one which will be used later in 
this chapter. 



Figure 5.27 A control system operating in open 
loop mode 


• Closed-loop transfer function. This gives the relationship between the output of the closed- 
loop control system (Figure 5.26) and its input. The closed-loop transfer function can be 
derived by noting that: 


Y(s) = G( S )[X(s) - H(s)Y(s)] 


(5.45) 


Rearranging gives 

Y(s) + G(s)H(s) Y(s) = Y(5)[1 + G(s)H(s)] = G(s)X(s) (5.46) 

and hence 


Y(s) _ G(s) 

X(s) 1 + G(s)H(s) 


(5.47) 


which is the closed-loop transfer function of the system, the ratio of output Y(s) to input 
X(s) in the s domain. The closed-loop transfer function is of particular interest in terms of 
understanding the error of the system (how the actual output from the system compares 
with the desired output). 

In order to determine the behaviour of the output of the system, two further mathematical 
tools are needed: the initial-value theorem and the final-value theorem. 


Initial-value theorem 

It is convenient to assume within classical control modelling that the initial state of a 
given control system is known. In other words, the starting point for future behaviour is 
unambiguously defined. It can usually be assumed that everything is initially at rest and in its 
‘neutral’ position (i.e. all values defining the state are zero). It can be useful to check that the 
transfer function, in conjunction with the inputs to the system, correctly calculates this state. 
This theorem may be stated as: 

lim ft) = lim sF(s) (5.48) 

Note that this finds the value of f(t) immediately after time t = 0 — in other words, an 
infinitesimal amount of time is allowed for the analysis to begin and for inputs to be ‘switched 
on’ (via the unit step and unit impulse functions given in items 1 and 3 of Table 5.1, for 
example). 

As an example, consider the spring— damper system described earlier, consisting of a spring of 
stiffness k and a damper of constant c in parallel, subjected to a load which is zero for t < 0 and 
of value q for t > 0. Mathematically this load can be expressed in terms of the Heaviside step 
function H(t): 

p(t) = qH(t) (5.49) 

The Laplace transform of the Heaviside step function is given in row 1 of Table 5.1, so that 
equation 5.49 transforms to: 


P(s) = 


(5.50) 
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From earlier work it was shown (equation (5.25)) that the displacement of the spring-damper 
system under this load of q applied at t = 0 can he expressed as 


X(s) = 


± 

sk 



(5.51) 


Inserting this expression into the initial-value theorem (equation (5.48)) gives 

I q q \ isq sq 

lim x(t) = lim sX(s) = lim s f~r j = lim I — r 


(5.52) 


ooXc 
~T = 0 


Note that the very large terms involving 5 swamp the smaller terms such as k/c, so that the 
latter may be neglected; the zero initial displacement of the system is correctly recovered. 

Now consider a simplified version of this system, which omits the damper. In this case, the 
equation describing the spring is very simple: 


kx{t) = p(t)-x( 0) = 0 

Dividing by k, taking Laplace transforms and noting the zero initial conditions gives 
X(S) = |P(5) 


(5.53) 


(5.54) 


Once again the load is represented as a Heaviside step function (item 1 in Table 5.1): 
p(t) = qH(t) 


so that 


P(s) 


q 


(5.55) 

(5.56) 


Combining these expressions gives the expression for displacement under the applied load: 


m = 


ks 


± 

sk 


(5.57) 


Inserting this into the initial-value theorem correctly recovers the displacement of the spring 
immediately after application of the load: 


lim x(t) = lim sX(s) = lim : 


1 

k 


(5.58) 


It could usefully be argued that this example is unrealistic as the displacement has occurred 
instantaneously on application of the load; in practice, of course, there would be some inertia, 
but that is not modelled here. 
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Final-value theorem 

Although the initial-value theorem provides a useful check that initial conditions are satisfied, a 
potentially more useful tool is the final-value theorem, which is a method of calculating the 
long-term value of the response of a system. The theorem may be stated as 


hm /({) = lim sF(s) ( 5 . 59 ) 

Note that this is only applicable where the final value is constant and finite. 

A straightforward example is the calculation of the displacement of the spring-damper system 
after it has had a chance to settle down. Recall that 


X(s) = 


± 

sk 



( 5 . 60 ) 


Inserting this expression into the final-value theorem correctly recovers the steady-state 
displacement of the system: 


lim x(t) = lim sX(s) = Hm s / — ^ 


! 

k 


( 5 . 61 ) 


A more sophisticated example will be given in Section 5.12. 


Learning summary 

By the end of this section you should have learnt: 

/ to simplify a block diagram containing a feedback loop to give a simplified block diagram involving 
a single transfer function; 

/ to find the values of an expression in the 5 domain at times t = 0 and t = oo. 


5.7 Building a block diagram: part 1 

The strength of the techniques of control modelling lies in the fact that models of complete 
systems can be assembled from the building blocks consisting of the models of individual 
components. For each component, it is necessary to understand how the output and input are 
related (in the time domain). This relationship, often in the form of a differential equation, is 
then transformed into the s domain, and is then rearranged to give the transfer function of that 
component. Some typical examples of components follow. 


Mass 


The transfer function giving the displacement of a mass in terms of a force applied to it is 
obtained by transforming Newton’s second law: 

d 2 x(t) 

M = ma(t) = ( 5 . 62 ) 

Using equation (5.17) and taking Laplace transforms gives 

F(s) = ms 2 X(s) — mx'(0) — msx( 0) ( 5 . 63 ) 
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where x'(0) is the value of dx/dt at t = 0. If zero initial conditions are assumed (i.e. 
displacement and velocity are zero up to the instant when t = 0), this simplifies to 

F(s) = tns 2 X(s) ( 5 . 64 ) 

This may be represented in block diagram form as shown in Figure 5.28. 


In this context (i.e. for this form of the block diagram) the transfer function of the 
mass is therefore the force divided by the displacement (where both are expressed as 
functions ofs): 


X(s) 


( 5 . 65 ) 




F(s) | 





Figure 5.28 Block diagram 
for a mass 


Sometimes, it is more convenient to consider the conversion to acceleration and the effect of 
the mass as separate ‘boxes’ in the block diagram as shown in Figure 5.29. 



Figure 5.29 Alternative block diagram for a mass 


Alternatively, equation (5.65) may be rearranged to give the displacement caused by a particular 
force: 


m = 

(5.66) 




The resulting block diagram is shown in Figure 5.30. 


X{s) , 

i 

F® , 

In this context, the transfer function is now: 



ms 2 


X(s) i 

F(s) ms 2 

(5.67) 

Figure 5.30 Another block 
diagram for a mass 


Spring 


The mathematical model of a spring is a very straightforward statement of proportionality 

(Hooke’s law): 


/(f) = kx(t) 

( 5 . 68 ) 

Taking Laplace transforms gives an equally simple result: 


F(s) = kX(s) 

( 5 . 69 ) 

so that the block diagram of a spring giving the displacement ii 
be drawn (Figure 5.31). 

n terms of the force can 

The transfer function is 


II 

( 5 . 70 ) 



Figure 5.31 Block diagram 
for a spring 


Conversely, the displacement may be expressed in terms of the force by simple rearrangement: 



II 

( 5 . 71 ) 




so that the transfer function is now 



F(s) f 

| 

X(s) f 


X(s) =1 

( 5 . 72 ) 


k 



Figure 5.32 Alternative block 

The resulting block diagram is shown in Figure 5.32. diagram for a spring 
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Dashpot or damper 

A dashpot or damper is a component that gives a reaction force proportional tc 

a the velocity 

v(t) of one end relative to the other. It is described mathematically as follows: 

dx(t) 

/(f) = Cl>(t) = 

(5.73) 

Taking Laplace transforms: 

F(s) = csX{s) - cx{ 0) 

(5.74) 

Assuming zero initial conditions, and rearranging, gives the transfer function 
F(s) 

X(s) * 

(5.75) 

or alternatively 

X(s) 1 

F(s) cs 

(5.76) 


The block diagrams for these two alternative representations are shown in Figure 5.33(a) and (b): 


X(f) t 


F(s) , 






cs 


Figure 5.33 Alternative block diagrams for a damper 


Flywheel or rotational inertia 

Just as Newtons law for linear motion relates force to acceleration via the equation F = ma, 
the torque l applied to a flywheel is related to that flywheels angular acceleration a 
(= 0 = w) by the following equation, which also involves the moment of inertia J, the 
rotational equivalent of mass: 

d 2 0(f) 

m =Mt) = je =J~^r (5-77) 


(Note that /(f) is used for torque rather that L(f) to he consistent with our notation that 
lowercase italic letters represent variables in the time domain, while uppercase italic letters 
represent variables transformed to the s domain.) 


Taking Laplace transforms by applying equation (5.16) to equation (5.77) gives the following 
equation in the s domain: 

m =Js 2 Q(s) -J0'( 0) ->0(0) (5.78) 


where 0(s) = >{0(f) }, i.e. the Laplace transform of the angular position 0(f). Setting initial 
conditions (angular position 0(0) and angular velocity 0'(O)) to zero, and dividing by 0(s) to 
rearrange the equation, gives the transfer function: 


L(s) _ 

0(s) JS 


(5.79) 


This can be represented via the block diagram shown in Figure 5.34. 



J 2 

m , 





Figure 5.34 Block diagram 
for an inertia load 
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Alternatively, the same concept may be expressed in terms of angular velocity oo(t): 


dco(t) 

m =M<) => = J^T 


Applying equation (5.16) gives: 

L(s) =JsCl(s) -Jco(O) 


Again assuming initial conditions, this gives: 


L(s) 

il(s) 


= js 


(5.82) 


where il(s) is the Laplace transform of angular velocity oo(f).The block diagram is 
shown in Figure 5.35. 


Rotational load with viscous characteristics 


This is the rotational equivalent of a dashpot, and again may be represented in terms 
of the angular position or angular velocity: 

d0(f) 

l(t) = to® = 


This gives the transfer function 


L(s) 


b 


(5.80) 

(5.81) 




L(i) t 


]s 



Figure 5.35 Alternative block 
diagram for a rotational 
inertia load 


(5.83) 


(5.84) 


DC motor 


0(5) ' 


(5.85) 


For steady-state purposes, the characteristics of a motor are very simple: torque /(f) is 
proportional to current i(t) supplied: 


giving the transfer function 


m = Kj( t ) 


I(s) ' 


(5.86) 

(5.87) 


It should be noted that a constant magnetic field is assumed within the motor (for instance, 
assuming the motor is of the permanent magnet type), and hence it is possible to use the single 
constant K m which is equivalent to the term fe(() used within Unit 4. As a first approximation 
(and especially under no-load conditions), the angular velocity u>(t) is proportional to the 
motor supply voltage v(t): 


«(f) = 


K m 


(5.88) 


Note that, for an ideal motor, the same constant K m appears in both equations (5.86) and 
(5.88). Equation (5.88) leads to the transfer function 


•Q(s) = j_ 

V(s) K m 


(5.89) 


A more sophisticated model of the motor takes account of the fact that the speed under load 
conditions depends on both the supply voltage v(t) and the torque l(t) which the motor is 
providing: 


w(t) 


vifl 

K m 


im 

K 2 m 


(5.90) 
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where R is the armature resistance of the motor. Note that, strictly, the torque 
in this expression includes any component of torque used for acceleration of 
the motor armature as well as any provided as the useful mechanical output 
from the motor. 

It is usual to plot torque vs. speed rather than vice versa, so Equation (5.90) 
may be represented as a set of torque vs. speed characteristics for a range of 
supply voltages (Figure 5.36). 

The Laplace transform of equation (5.90) (for zero initial conditions) 

L(s)R 


„ V{s) 


kJ 


simply 

(5.91) 


m = e ( 5 ) — -. 0 ( 5 )-^- 

so that the block diagram can now be drawn as in Figure 5.38. 



This situation can be represented via a block diagram (Figure 5.37). 

Alternatively the torque available for driving and/ or accelerating the load (and 
the armature) can be represented in terms of the speed: 


Figure 5.36 Typical set of torque- 
speed characteristics for a dc motor 


(5.92) 


Qfr). 


KJ 


Amplifiers and other components with gain 

The situation is frequently encountered in control engineering where a signal 
is amplified (made larger) using some form of amplifier. An everyday example 
is the amplifier used in a home entertainment system, which takes a small 
signal (from a DVD player, MP3 player etc.) and makes it much larger in order 
to drive the loudspeakers. In simple terms, an amplifier multiplies a signal (e.g. 
a voltage) by a given constant K and therefore involves a transfer function 
which can be expressed as 

VUs) 

VM 

The concept of a gain or constant of proportionality can be extended to 
numerous other situations, including sensors with calibration constants. A good 
example is a tachogenerator, which is a speed sensing device that produces a dc voltage output 
signal proportional to angular velocity. Such a device will have a certain gain in volts per unit 
of angular velocity. Other examples are simple components such as resistors (which allow a 
current to flow in proportion to voltage). A simple proportional gain (where the error signal is 
multiplied by a constant to give the corrective action) will form one possible component of the 
control algorithm to be examined in Section 5.11. 


Figure 5.37 Block diagram for a dc 
motor 




L6) > 

K m 


K 

(5.93) 


R 

4\ 


Figure 5.38 Alternative block 
diagram for a dc motor 


Pumps and tanks 


These components are widely encountered within courses on control theory as they relate 
to quantities (flow rate, liquid level) that are easily visible. The behaviour of a positive- 
displacement pump is very straightforward as a first approximation: the volume flow 
rate of fluid q(t) delivered is proportional to the angular velocity w(t) with a constant of 
proportionality fC, giving the transfer function 


Q(s) 

ft(5) ' 


(5.94) 


The behaviour of a tank requires 


an understanding of its differential equation: 


4in« - 4out(<) = A 


d h(t) 
At 


(5.95) 


341 


An Introduction to Mechanical Engineering: Part 2 


where h(t) is the liquid level, A is the cross-sectional area of the tank, 
and q m {t) and q OM (t) are respectively the rates of inflow to, and outflow 
from, the tank (Figure 5.39). 

This results in the following expression in the s domain: 

Q in (s) - Q 0 Js) = AsH(s) - Ah( 0) (5.96) 

which, for zero initial conditions, can be represented as 

m = ^(QinW - QoutW) (5.97) 

This can be represented in block diagram form (Figure 5.40). 



Figure 5.40 Block diagram of tank 



Figure 5.39 Water tank showing inflow Q in (f) 
and outflow Q out (t) 


Flow restrictions and linearization 

Components such as pipe bends, orifices, valves etc. are often encountered in control systems 
that involve fluid flow. It would be tempting to treat such flow restrictions as being analogous 
to resistors, until it is realized that, strictly speaking, they are not linear devices. For example, an 
orifice plate or valve has a pressure drop Ap(f) across it that is approximately proportional to the 
square of the volume flow rate q(t): 

A Pit) = Cm 2 (5-98) 

where C is a constant depending upon the nature of the flow restriction and the density of 
the fluid. It would appear superficially that it is impossible to represent such a component 
within the present (linear) framework. However, in practice, many control systems operate 
over a fairly narrow range of speeds, depths, flow rates etc., and it is possible to simplify the 
system’s behaviour by considering small perturbations about a given operating point. This results 
in behaviour that can be assumed to be linear within a small range around the operating point. 
For the present (simplified) purposes, it is sufficient to assume that the flow rate q(t) from a tank 
via a flow restriction is proportional to the height (‘head’) h(t) of liquid above the restriction: 

h(t) 

q(t) = (5.99) 


where R is the linearized flow resistance of the restriction referred to the head of liquid. Taking 
Laplace transforms results in the following expression: 


Q(*) = 


H(s) 

R 


(5.100) 


where H(s) = %{h(t)} 

In general, linearization is used to cope with all manner of situations where the true behaviour 
of a device is non-linear, but where small perturbations around an operating point are to be 
considered. Examples might include 

• the relationship between liquid depth h (head) and flow rate q, described above (Figure 
5.41(a)); 

• the relationship between torque and speed of a centrifugal blower (Figure 5.41(b)). 
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Figure 5.41 Linearization of (a) flowrate-head characteristics for a tank; 
(b) torque-speed characteristics for a blower 


Summing and differencing junctions 

Within control systems, the effects of inputs and other influences are often combined via 
summing and differencing junctions; such junctions have already been used freely within 
the earlier examples. As another example, two flows may sum to form a larger flow (Figure 
5. 42(a)). The summing of the flows is represented via summing junction in a block diagram 
(Figure 5.42(b)). 

o 




Figure 5.42 (a) Summation of two flows at a pipe junction; (b) block 
diagram of pipe junction represented as a summing junction 



A very frequently encountered situation in control loops is the concept of negative 
feedback where the measured value of a signal is compared with the desired value, and 
the difference between them is known as the error. This has already been encountered 
in the examples given above. In the introductory example of the cruise control, the 
actual speed v(t) may be compared with the desired or demand speed v n (t), so that, 
in the 5 domain, the feedback quantity V(s) is subtracted from the demand V n (s) to 
obtain the error signal E(s) (Figure 5.43). 



Figure 5.43 Typical differencing 
junction 


Transfer function of a controller 

In order to correct the error consisting of the difference between desired and actual values 
of the system response, some form of controller is used. The detailed behaviour (and transfer 
function) of typical systems will be explored in Section 5.1 1, but for the present time it 
will be represented simply by the symbol G c (s) (Figure 5.44). 









Figure 5.44 Block diagram 
of a controller 


Learning summary 

By the end of this section you should have learnt: 

/ the concept of a transfer function; 

/ the transfer functions of a range of simple components; 
/ the concept of linearization. 
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5.8 Building a block diagram: part 2 

By this stage, the reader should have sufficient information to be able to construct the transfer 
functions (and hence draw the block diagrams) for each of the components. The remaining 
challenge lies in correctly assembling the diagram for the whole system from its components. 

In constructing block diagrams, it is conventional to assume that all variable quantities or signals 
(such as displacement, speed etc.) are functions of s rather than time. Although each component 
may itself be trivial to model, the construction of the block diagram of a system may require 
careful thought, and it is not always obvious which way around a particular relationship (such as 
the relationship between torque and speed of a motor) should be applied. 



Voltage signal giving feedback of position 


Figure 5.45 Over-simplistic positioning system 


Two examples of block diagram construction have already been presented within Sections 5.3 
and 5.5, but it is instructive to explore the assembly of the block diagram components from 
Section 5.7 via two further examples of varying complexity. A simple (and flawed) example 
is a positioning system (Figure 5.45) consisting of a motor with constant K m driving a load 
of inertia J, with the motor being driven via current amplifier that provides the motor with a 
current of K :l amperes per volt of input. Position is assumed to be sensed via a device that gives 
an output of K s volts per radian of rotation of the motor. The input to the system is a demand 
voltage v n (t) which is proportional to the desired position. The voltage, tq>, corresponding to 
the actual position is subtracted from f D to give an error voltage, which is amplified by the 
servo-amplifier to give the input current i m to the motor which will create a torque to rotate 
the load with the aim of correcting the positioning error: 


« m (0 = FQMf) - u P (f)) 

Taking Laplace transforms and assuming that all voltages etc. are 

Us) = K(Vd(s) ~ V P (s)) 

This may be represented within a block diagram (Figure 5.46). 


litially zero: 


Figure 5.46 Block diagram of differencing 
junction and amplifier for positioning 


The motor will be assumed to be acting as a source of torque /(f) dependent 
upon the current i(f) by adapting equations (5.86) and (5.87) to the present 

system 



situation and rearranging so that 

Us) [ 


L(5) t 

m = K m I m (s) (5.101) 




The block diagram corresponding to this is straightforward and is shown 
in Figure 5.47 

Figure 5.47 Block diagram 
of motor 

The relationship between torque /(f) and angular position 0(f) (and hence 




their transforms L(s) and 0(5)) illustrates one of the pitfalls of constructing 
a block diagram from its components. Although the block diagram for an 
inertia load J is straightforward and was given earlier as Figure 5.34, it is 

L fr) [ 


®6) < 


J ? 



necessary here to reverse the flow and invert the transfer function in order 
to obtain 0(s) in terms of L(s) (Figure 5.48). 


Figure 5.48 Rearranged block 
diagram of inertia 
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The relationship between angular position 0(t) and the position-related voltage v p (t) 
is a straightforward linear one involving the sensor constant K s and, for brevity, we will 
proceed straight to the block diagram (Figure 5.49). 

The complete block diagram for this system is shown in Figure 5.50. 



Figure 5.50 Block diagram for (over-simplistic) position control system 



Figure 5.49 Block diagram 
of sensor 


Interestingly, this would not form a very useful control system as its response would be both 
undamped and oscillatory (the control system is merely equivalent to an undamped rotational 
spring acting upon the inertia load). This further illustrates the need for an effective control 
algorithm, and will be explored further in Sections 5.11 and 5.13. 



representing measures speed 

actual speed as a vo ltage o) 


Figure 5.51 Radar antenna on viscous bearing, with tachometer control 

A more complex example is as follows. A radar antenna (or aerial) may be represented as a 
viscous load with torque— speed constant b, i.e. in order to drive it at constant angular velocity 
«, a constant torque l is required: 


l = bw (5.102) 

It also has a moment of inertia J. It is driven by an electric motor with motor constant K n] and 
armature resistance R, at a speed that must be closely controlled. The desired angular velocity 
is co D (which is shown for illustrative purposes in Figure 5.51 as being manually entered via an 
adjuster which creates a variable voltage proportional to « D ), and the actual angular velocity 
« is measured via a tachogenerator with a constant of IC t Vs/ rad. The system is controlled via 
negative feedback with a controller with transfer function G c (s) and via a voltage amplifier with 
gain K :i . 

Consider firstly the antenna itself. In order to drive it at a given value of angular velocity w, a 
constant torque l is required, as expressed in equation (5.102), so it is straightforward to treat 
the relationship between antenna torque and antenna speed in the time domain as 


m = mo 

(5.103) 

After taking Laplace transforms this results in 

L(s) = bQ,{s) 

(5.104) 
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which leads to the block diagram shown in Figure 5.52. 

However, if the torque l m available from the motor for driving it exceeds the torque l 
required to overcome viscous friction and maintain a steady speed, it will accelerate at the 
rate (l m — l)/J, so that the equation of motion of the antenna is 


Figure 5.52 Block diagram 


dw(f) 

jm = = Zm(t) _ 

( 5 . 105 ) 

This can be represented in the s domain as 


Jsil(s) = L^is) ~ L(s) 

( 5 . 106 ) 

which can be rearranged to give 


a(s) =j s (L m (s) - m 

( 5 . 107 ) 


The corresponding block diagram is therefore as shown in Figure 5.53. 
Inserting equation (5.104) into equation (5.107) gives 

n(s) =j s (L m {s) - bil(s)) ( 5 . 108 ) 

The corresponding block diagram (Figure 5.54) for the antenna, treated 
as a combined frictional and inertia load, can be created either directly 
from equation (5.108) or by joining together the relevant flows from 
Figure 5.52 and Figure 5.53. 



Figure 5.53 Block diagram of inertia load with 
external drag L(s) 


L(s) 


Figure 5.54 Block diagram of antenna treated as 
combined frictional and inertia load 

However, the antenna is driven from a motor that will provide a torque l m that depends o 
speed 0) at which it is rotating: 

L(t) = v(t) - (O(f) ( 

or in the s domain: 

LJs) = V(s) ^ - n(s) ^ ( 

This is shown in block diagram form in Figure 5.55. 


1 


< 

R 

1 

, 


Figure 5.55 Block diagram for motor 
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The block diagram for the antenna-motor 
assembly is therefore as shown in Figure 5.56. 

But recall that this system is driven via a voltage 
amplifier from the output of a controller, which 
acts upon the error signal. This error signal is in 
turn the difference between the required speed 
n D (s) (converted into a voltage) and the voltage 
signal FC ( H(s) from the tachogenerator. This 
control system driving the antenna and its motor 
may be represented as shown in Figure 5.57. 





Motor 

Figure 5.56 Block diagram for antenna-motor assembly 



Tacho-generator 

Figure 5.57 Block diagram for control system 


Inserting the block diagram for the antenna and motor results in the block diagram for the 
whole system (Figure 5.58). The behaviour of this system will be explored in Sections 5.12 
and 5.13, though it is first necessary to convert the rather complex block diagram to a single 
transfer function. 



Tacho-generator 

Figure 5.58 Block diagram for complete antenna system 


Learning summary 

By the end of this section you should have learnt: 

/ to construct a block diagram for systems or subsystems based on the transfer functions of each 
of its components or features; 

/ to assemble the block diagram for a complex system from the diagrams for its various 
subsystems; 

/ to appreciate that distinguishing the input and output of particular processes is not always 
straightforward. 
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5.9 Conversion of the block diagram to the 
transfer function of the system 

This process is broadly based upon two rules: 

(1) the multiplication of the transfer functions of successive blocks in a sequence; 

(2) the simplification of the block diagram shown in Figure 5.59(a) into a single transfer 
function shown in Figure 5.59(b), as demonstrated in equations (5.45) to (5.47). 



Figure 5.59 Block diagram of a typical control loop: (a) before and (b) 
after reduction to single transfer function 


Essentially, this pair of actions is applied repeatedly to the feedback loops within a complex 
control system, starting with the innermost loop and working outwards. 

Taking first the example of the block diagram for the controlled heat transfer system (Figure 
5.21), the innermost loop can he tackled by replacing G(s) with the transfer function 1 /mC p s 
of the main flow, and H(s) with the transfer function Ah of the feedback flow. Substituting into 
equation (5.47) gives: 

1 

G(s) tnC p s 1 

1 + G(s)H(s) ~ ~ " lC v s + Ah (5 ' 111] 

mCpS 

giving the same result as the more lengthy manipulation within equation (5.31). 

The overall block diagram then simplifies to Figure 5.60(a) and hence to Figure 5.60(b). 



Figure 5.60 Simplified versions of block diagram for heating system 


Now, the process of simplifying the structure of the block diagram is repeated. G(s) is replaced 
with K/(Ah+mCpS), and H(s) with 1, since there is no transfer function associated with the 
feedback flow, which simply feeds the output directly back to the differencing junction. This 
manipulation results in the transfer function for the system: 

T(s) G(s) KJ (mCpS + Ah) K 

T d (j) ~ 1 + G{s)H(s) ~ 1 + K/(mC p s + Ah) ~ mC p s + Ah + K 
The error (i.e. the difference between desired and actual values of output) is: 

TM - m = TM - + K - rcfr) mC " C ^„t K 

This expression will be used further in examining steady-state error in section 5.12. 

Dealing now with the radar antenna example, the innermost loop in Figure 5.58 can he 
considered by replacing G(s) with the transfer function of the main flow 1 /(Js) and H(s) with 


(5.112) 

(5.113) 
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the transfer function of the feedback flow b. The identity given in 
equation (5.47) is then used to enable the innermost loop to be replaced 
with a single block containing the transfer function: 


l/lft) _ 1 

1 + b/(Js) Js + b 


(5.114) 


Kl 

R 


The resulting block diagram for the antenna and motor is now as 
shown in Figure 5.61. 

Applying the same principle to the next loop involves replacing G(s) 
with 1 /(Js+b) and H(s) with K m 2 /R, again giving a single block 
with the transfer function 


K m 

mr 



~R 



js + b 


Figure 5.61 Block diagram for antenna and motor 
after simplification of inner loop 


!/(,£ + b) 1 _ R 

1 + K m 2 /[R(Js + b)] Js + b + Kj/R RJs + Rb + K m 2 


(5.115) 


giving a further simplified block diagram for the combination of antenna and motor (Figure 
5.62). 



K m 


R 



~R~ 


RJs + Rb + Kj 



Figure 5.62 Block diagram for antenna and motor after simplification of outer loop 

The resulting block diagram for the whole system is now as shown in Figure 5.63. 



Figure 5.63 Block diagram for complete antenna system before combining blocks 


The transfer functions of the four innermost blocks may be multiplied out to give the 
following much simpler block diagram for the system (Figure 5.64). 



Figure 5.64 Simplified block diagram for complete antenna system 


Substituting yet again for G(s) and H(s) within equation (5.47) for the remaining loop allows us 
to replace that loop with the following transfer function (Figure 5.65): 


G c (s)K 3 K m I K t G c (s)K 3 K m G c (s)K 3 K m 

RJs + Rb 1 K m 2 1 1 + RJs + Rb + Kj ~ RJs + Rb + Kj + K t G c (s)K 3 K m 


(5.116) 


Op(0 

K t 


G c (s)K 3 K m 

0(0, 



RJs + Rb + K m 2 + KJJJsjKR,,, 



Figure 5.65 Elimination of loop for complete antenna system 
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Finally, multiplying by the transfer function of the leftmost block results in the transfer function 
for the whole system: 

Qfr) K t G c (s)K,K m 

a D (s) RJs + Rb+ K m 2 + K t G c (s)K,K m 
enabling the block diagram to be reduced to a single box (Figure 5.66) 


fl D (s) ^ 

K t G c ( S )K-^ m 



RJs + Rb + K,„ 2 + K t G c (s)K 1 K m 



Figure 5.66 Final block diagram for complete antenna system 


An expression can therefore be written for the error, which is the difference between the 
desired value of angular velocity and the achieved value: 

/ 1 - K t G c (s)K a K m \ 

~ °( 5 ) - \ R j s + Rb + K J + K t G c (s)K 3 K m ) nD(5) 


RJs + Rb + K m 2 + K t G c (s)K. tl K m - K t G c (s)K a K m 
RJs + Rb + K m 2 + K t G c (s)K a K m 


n D (^) 


(5.118) 


RJs + Rb + K m 2 

~ RJs + Rb+ K m 2 + K t G c (s)K,K m 

This expression will be used in Section 5.12 when examining the steady-state response of the 
system. 


Learning summary 

By the end of this section you should have learnt: 

/ the relationship between the transfer function of a closed-loop system and the transfer functions of 
its main and feedback paths; 

/ the procedure for simplifying a block diagram with one or more loops in order to reduce it to a 
block diagram involving a single transfer function. 


5.10 Handling block diagrams with 
overlapping control loops 


A problem appears to arise when dealing with a block diagram including loops that are not 
simply nested one inside the other, but overlap in the manner shown in Figure 5.67. 



Figure 5.67 Block diagram with overlapping control loops 
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This is handled via a simple trick. The point at which one of the loops branches away from the 
main flow of the diagram is moved, and appropriate blocks are introduced into the feedback 
loop to compensate for the change in the topology of the diagram. For example, if a branch 
leading to a feedback loop is moved ‘upstream’ from the original branching point, the blocks 
that now lie outside the loop are now replicated within the loop itself so that the open-loop 
transfer function for that loop stays the same. In the example given above, the point at which 
the upper loop leaves the main flow is moved to coincide with the point at which the lower 
loop leaves. However, this would involve omitting the block containing G 4 (s) from that control 
path. That block is therefore reintroduced into the loop so that the overall transfer function for 
that loop stays the same (Figure 5.68). 



Figure 5.68 Block diagram rearranged to avoid overlapping of loops 


It is now straightforward to reduce this block diagram to a single transfer function in the 
usual way. Replacing G(s) in equation (5.47) with G 3 (s) and H(s) with H 2 (s)H 4 (s), the inner 
loop involving G 3 , H 2 and G 4 can be expressed as a single transfer function, so that the block 
diagram can be drawn as shown in Figure 5.69. 



Figure 5.69 Block diagram redrawn to eliminate inner loop 


By a similar process the remaining loop is eliminated (Figure 5.70). 



Figure 5.70 Block diagram redrawn to eliminate outer loop 


The block diagram is now in the form of a straightforward sequence of blocks whose transfer 
functions can be multiplied together to give the transfer function of the overall system (Figure 
5.71). 


GMGJAG&GM 

m 

1 + G 3 (s)H 2 (s)G 4 (s) + G 2 (s)G 3 (s)Hi(s) 



Figure 5.71 Block diagram redrawn in the form of 
a single transfer function 
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5.11 The control algorithm and proportional- 
integral-derivative (PID) control 

So far, only the simplest assumptions have been made regarding how the error (which may 
be colloquially expressed as ‘the difference between what you want and what you get’) is 
processed. The rule by which this error is processed in order to obtain the required corrective 
action is known as the control algorithm and is central to the behaviour of feedback and control 
systems. Ideally, the control system should: 

• take more corrective action if the error is larger; 

• take more corrective action if the error has persisted for a long time; 

• take more corrective action if the error is getting worse (and less if it is getting better). 

A very widely used control algorithm is the three-term or PID (proportional-integral- 
derivative) controller, which formalizes the three kinds of decisions defined verbally above. 
Specifically, it sets the corrective action to a value that depends on the sum of 

• a multiple of the current value of the error; 

• a multiple of the integral of the error, i.e. the value of the error integrated or accumulated 
over the time since the beginning of the process; 

• a multiple of the derivative of the error i.e. its rate of change with time. 

Mathematically the control algorithm is defined as: 

G c (s) - K c (l + T n s + 

K c 

= K c + K c Tns + Tjr- (5.119) 

IjS 

where: 

K c is the gain of the controller 

T d is the derivative time constant (a larger time constant leads to more derivative action) 

Tj is the integral time constant (a larger time constant leads to less integral action). 

This is sometimes written alternatively as 

K, 

G c (s) =K C + K d s + — (5.120) 


In general, the terms within PID control have the following effects: 

• proportional control alone tends to result in steady-state error, which means that the 
system output never reaches the desired value, typically tending asymptotically to it; 

• integral action in conjunction with proportional control tends to eliminate steady-state 
error, at the expense of tending to make the system s response oscillatory or even unstable; 

• derivative action can in some circumstances damp down the tendency towards oscillatory 
behaviour without reintroducing steady-state error. However, there can be pitfalls in 
differentiating real signals as this procss can tend to accentuate noise. It can be preferable to 
measure a derivative directly, for example as an angular velocity rather than obtaining it by 
numerical differentiation. 


As an example, consider the positioning system described earlier. It is straightforward 
from Figure 5.50, that the open-loop transfer of the position controller system is 
F P (i) K a K m K s 
Js 2 




(5.121) 


For illustrative purposes, the values K a = 10, K m = 1A/V and FQ = 1 Vs/rad will be 
assumed. It can be demonstrated, either analytically by reversing the application of Laplace 
transforms to find the time domain solution, or by numerical simulation, for instance, using 
the Simulink simulation package (www.mathworks.com/products/simulink), that the response 
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of this system to a demand of 10 V (representing a request to move by 10 radians) results in 
an oscillatory response which continues forever at constant amplitude (Figure 5.72), which 
is clearly not a useful response. And we have not yet considered what happens when there is 
some external torque trying to move the load away from its desired position. 



Now let us replace the simple servo-amplifier driving the motor with a controller/amplifier 
of gain K c and integral and derivative time constants T : and T D , and additionally introduce a 
torque load L t (s) which is applied externally to the shaft carrying the flywheel. The resulting 
block diagram is shown in Figure 5.73. 



Figure 5.73 Block diagram of positioning system with PID control and an external load 


The open loop transfer function can no longer be expressed as a simple ration of output to 
input, but for completeness the following expression can be derived: 


Vv (s) = VjM 


K c K m KJ n [ s i + % s+ T^J + sL d4 


(5.122) 


Along with the values of system constants assumed earlier, the values T D = 0.5 second and 
T| = 1 second will be assumed for illustrative purposes, and a demand of 10 V and an external 
load of 20 Nm will be imposed at t = 0. Proportional control only (omitting both integral 
and derivative term) again results in a sinusoidal response of constant amplitude similar to that 
shown in Figure 5.72, this time oscillating about a mean value of 12 radians. Proportional 
plus integral control only (i.e. omitting the derivative term) results in an exponentially 
growing sinusoidal response (Figure 5.74). Proportional plus derivative control gives a well- 
damped response rapidly settling to a steady value of 12 radians, indicating a steady-state error 
of 2 radians. None of these responses is really satisfactory; however, using all three terms 
(proportional, integral and derivative control) results in a response that overshoots a tittle but 
rapidly settles to the desired value of 10 radians without any steady-state error. 
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Desired position 

Proportional plus integral action 

Proportional plus derviative action 

Proportional, integral and derviative action 


Figure 5.74 Response of positioning system with proportional plus integral control alone (blue line and 
inset), proportional plus derivative control (cyan line) and PID control (green line) 


The application of different control algorithms to the radar antenna problem is considered in 
a broadly similar manner in the next section with reference to steady-state error and dynamic 
response, and the response of both of these systems is revisited yet again in Section 5.13 on the 
root locus method. 

Learning summary 

By the end of this section you should have learnt: 

/ to assemble a collection of transfer function boxes relating to separate aspects of a control system 
to form a complete block diagram for that system; 

/ to rearrange and simplify that system to find the transfer function of the system; 

/ the importance of the control algorithm; 

/ the purpose of the three terms in a PID controller; 

/ typical effects of the three contributions to PID control. 


5.12 Response and stability of control 
systems 

In order to understand the actual behaviour of a given control system, the chosen control 
algorithm must be substituted for G c (s). Once this has been performed, a number of issues can 
be examined: 

• Is the system stable? 

• Is its response oscillatory, i.e. is it overdamped or underdamped? 

• Is there a steady-state error, i.e. does the actual output from the system converge to the 
desired outcome, or is there a difference that persists indefinitely? 

These issues will be examined by continuing to examine the heater and radar antenna problems. 

Steady-state error 

Steady-state error is the difference between demand and achieved output which persists as time 
becomes infinite, and is typically evaluated for one of two situations: 

• a step input at t = 0, corresponding to a demand issued at t = 0 for the system to attain a 
steady value; 
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• a ramp input beginning at t = 0, corresponding to a demand for the output to increase 
linearly from zero with time, i.e. for a constant rate of motion or rate of change of output. 

As a link with earlier work it will be recalled that the error for the heated system under closed- 
loop control was given in equation (5.113). If a step input is assumed, consisting of a demand 
for a temperature iq imposed at t = 0, the demand can be expressed in the s domain (by using 
line 1 of Table 5.1). 


T d (5) = j,5£{H(t)} = h 


( 5 . 123 ) 


Inserting this into equation (5.113) and applying the final value theorem (equation (5.59)) 
gives the error as time tends to infinity: 

T , mCpS + Ah Ah 

Steady-state error = lims — — . — — — t, ( 5 . 124 ) 

y 5— »o ^ mC p s + Ah + K Ah + K 1 ' ' 

which recovers an expression for error equivalent to equation (5.8) which was derived via the 
simple, time -independent analysis, confirming that this simple proportional control algorithm 
does not achieve a degree of control which eliminates steady-state error. 

If, by contrast, a linearly increasing demand of t 2 °C/s is imposed, beginning at t = 0, the 
demand is modelled in the 5 space by making use of line 2 of Table 5.1: 

T d (5) = ^£{tH(t)} = J ( 5 . 125 ) 


Attempting to calculate the steady- 

Steady-state error = lim 


error gives 

t 2 mCpS + Ah 
7 mC-s + Ah + K ~ 


from which we can conclude that this simple control system will not only fail to achieve the 
desired (constantly increasing) temperature, but indeed will fail even to provide a steady offset 
from the desired temperature and the error will become infinitely large with time. 

Considering now the antenna problem, if proportional action K c alone is assumed within 
equation (5.119) so that G c (s) = K c , T D = 0 and T : = oo, and that expression for G c (s) is 
further substituted into equation (5.118) which gives the error in the antenna angular velocity, 
the following expression is obtained for the error as a function of s: 

„ , „ I RJs + Rb + K m 2 \ 

fi D (*) ^0) - + Rb + Kj + K X K C K^KJ ( 5 . 126 ) 


Applying the final-value theorem enables the steady-state error to be calculated for the 
situation where the demand Tl r> (s) takes the form of a Heaviside step function at t = 0 and of 
magnitude Hp 

Hi I RJs + Rb + K m 2 \ 

hm(ft D (5) - m = li ™ — [ RJs + Rb + Km 2 + K t K c K 3 K m J (5 - 127) 

I Rb + K 2 m \ 

\Rb + K 2 m + K t K c K,K m r h 

which is a non-zero value. 

The effect of proportional action alone is illustrated in Figure 5.75. For illustrative purposes, the 
value of R will he taken as 1 IT, / as 1 kgm 2 , b as 1 Ns/rad, K m and K t both as 1 Vs/rad, K :i as 1 
and K c as 10. It is seen that the response to a demand beginning at t = 0 for a speed of 
II | = 10 rad/s is an actual response tending asymptotically to 8.333 rad/s — a steady-state error 
of 1.667 rad/s correctly predicted by equation (5.127). 
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Desired angular velocity 

Proportional action only 

Proportional plus integral 




Figure 5.75 Response of radar antenna under proportional control alone and under proportional 
plus integral action 


By contrast, substituting the following expression (corresponding to proportional plus integral 
action, but no derivative action) into equation (5.118): 

G c (s) = K c (l + (5.128) 

results in the following expression for error: 

RJs + Rb + K m 2 

ll D (s) - i 1(5) - 11 d (5) R j s + Rb + K j + KtKc[1 + \/(T { s)]K z K m 


I T lS {RJs + Rb + Kj) \ 

0e >( S \T t s(Rfs + Rb + K m 2 ) + K t K c ( 1 + T^jK^Kj 

I T t RJs 2 + T,(Rb + Kj)s ' 

\ T t RJs 2 + T l (K t K c K z K m + Rb + K m 2 )s + K t K c K i K mt 


(5.129) 


Therefore, substituting for fl d (.?) as before and applying the final-value theorem gives the 
following expression for steady-state error: 

lirn(fl D (5) — f \(s)) 

nj_/ TjRJs 2 + 7)(Rfe + K m 2 )s 

"" \ l\RJs 2 + T 1 (K t K c K ll K m + Rb + K m 2 )s + K t K c K 3 K m 

= 0 

In other words, inclusion of the integral term has eliminated steady-state error. This is also 
shown in Figure 5.75 for an integral time constant 1] of 2.8 seconds, chosen to achieve a rapid 
settling time without oscillation (corresponding to critical damping) in response to a step 
input. A more detailed consideration of the form of the response (oscillatory or otherwise), and 
the effect of including derivative action, are included in the discussion on dynamic response of 
second-order systems contained in the next subsection. 


Dynamic response 

So far, we have concentrated on what happens immediately after our system experiences its 
input and in the (infinitely) long term. No attempt has yet been made to quantify how the 
control system behaves in the short and medium term in response to an input. We are likely to 
be particularly interested in the following questions (refer to Figure 5.76 for typical responses): 
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• Does the system respond monotonically to an input or does its value overshoot and oscillate 
— in other words, is its response overdamped, underdamped or critically damped? 

• If the response is underdamped, at what frequency does the output oscillate? 

• Does the system settle down to a steady value or does the output grow increasing large 
(possibly in an oscillatory manner) with time? In other words, is its response stable? 

These issues are covered in significant depth in Unit 6, and an understanding of these types of 
damping will be assumed here. 

In practice, it is found that many systems typically exhibit behaviour analogous to a second- 
order (mass— spring— damper) system, and it is instructive to answer the above questions by 
relating the system’s response to that of a second-order system. 



Underdamped (£ = 0.2) 
Critically damped (£ = 1) 
Overdamped (C, = 2) 


Figure 5.76 Graphs of underdamped, critically damped and overdamped responses of a 
second-order system 


The key to answering the above questions lies in the closed-loop transfer function, and in 
particular the characteristic equation which is obtained by setting the denominator of the 
transfer function (the characteristic polynomial) to zero. However, before examining a 
true control system it is instructive to examine the simple mass— spring— damper system (Figure 
5.77(a)) against which we will compare a control systems behaviour. 


© 1- 
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Figure 5.77 (a) Mass-spring-damper system; (b) free-body diagram of 
mass 


Consider the mass to be subjected to an externally applied force/(f). From the free-body 
diagram of the mass under displacement x(t) (Figure 5.77(b)) and noting that Newton’s second 
law states that net force is equal to mass X acceleration, the equation of motion of the mass can 
be written down as 


/(f) - kx(t) 


dx(t) 
C df 


d 2 *(Q 

m ^r 


(5.131) 
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This can be rearranged a: 


d 2 x(t) d x(t) 

M = m + kx(t) 


d t 2 d t 

Taking Laplace transforms, and assuming zero initial conditions, gives 
F{s) = s 2 mX(s) + scX(s) + kX{s) 

This can be rearranged to give the transfer function of the system: 

X(i) _ l 

F(s) ms 2 + cs + k 

However, it is instructive to rearrange it again into as 
X(s) _ 1/m 

F(s)~ s 2 + ± s+ k 


(5.132) 

(5.133) 

(5.134) 


(5.135) 


The characteristic equation 


be obtained by setting the denominator equal to zero: 

52 + m + m = ° (5.136) 


However, it is well known that, for a simple mass— spring damper system, the following 
quantities can be defined: 

• the natural frequency oo n where 

oo n = ^ (5.137) 

• the damping ratio t, (pronounced ‘zeta’) where 



The value of t, gives an indication of the response of the system, giving a more precise way to 
characterize the behaviours shown in Figure 5.76: 

• If ^ = 0, the system is undamped and will continue to oscillate indefinitely if it is perturbed. 

• If 0 < t, < 1, the system is said to be underdamped and will exhibit overshoot and a 
decaying oscillatory response when perturbed. 

• If ^ = 1, the system is said to be critically damped, and will exhibit a response that tends 
towards its steady-state response rapidly hut without oscillation (or with only a small amount 
of overshoot). 

• If £ > 1, the system is said to be overdamped and will tend asymptotically to 
its steady-state value, and may take a long time to reach a value that can he 
regarded as close enough to the value desired. 

An everyday example of a second-order system that could fall into any of these 
categories would be a door that can open in either direction (such as a door between 
sections of a factory, or between a restaurant and a kitchen) and spring-loaded to 
return it to its closed position while allowing people or vehicles to push it open 
in either direction without operating a latch (Figure 5.78). 

If the sprung door is underdamped, it will swing backwards and forwards after 
someone has passed through it; if it is overdamped, it will take a long time to close, 
and if it is critically damped it will close rapidly without swinging backwards and 
forwards. Common sense indicates that in this case, as in many practical situations, a 
critically damped response gives the best compromise between obtaining a quick 
response and avoiding prolonged oscillations, and this reasoning is generally 
applicable also to control systems. 



of a second-order system 
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Note that the natural frequency is not necessarily quite the same as the frequency at which 
oscillations will occur if the system is perturbed and allowed to undergo free vibration - it 
is only equal to the free oscillation frequency for the undamped case (£ = 0). If the system 
is underdamped, the actual frequency will be w n V(l — £ 2 ) while if it is critically damped or 
overdamped, no oscillation will occur at all. 

Returning to the analysis of the second-order (mass— spring— damper) system, the characteristic 
equation can now be expressed in terms of the natural frequency and damping ratio as 

5 2 + 2 £ov + w„ 2 = 0 (5.139) 

Second-order characteristic equations are frequently encountered in control systems, and it is 
often useful to determine by inspection of the characteristic equation what its response will be. 

As an example, consider again the motor-driven, viscous-damped antenna described earlier. If 
proportional plus integral action is used, the closed-loop transfer function becomes 

fl(s) K t K c(l + l/T l s))K ll K m 

n D (s) ~ RJs + Rb + K m 2 + K t K c ( 1 + ]/(T l s))K 1 K m 


K t K c {T t s + 1 ) K :1 K m 

'TiRJs 2 + RbT lS + K m 2 T t s + K t K c (T ]S + 1)K,K rn 


(5.140) 


K t K c K,K m (T [S + 1) 

T,R/r + ( Rb + K m 2 + K t K c K a K m )T I s + K t K c KJ< m 


The characteristic function (the denominator) is clearly that of a second-order system. The 
transfer function can then be rearranged to get the characteristic function (the denominator) 
into the form in which it appears in equation (5.135): 


fl(i) K l K c KJ< m {T\S + 1 )/{T l RJ) 

n D (s) ~ s 2 + [(Rb + K m 2 + K t K c K, A K m )/(RJ)]s + K t K c K z KJ (TJlj) 


(5.141) 


The characteristic equation can therefore he written down by setting the denominator to zero: 
s 2 + [(Rb + K m 2 + K t K c K, d K m )/(RJ)]s + K.K^KJiTJlj) = 0 (5.142) 


From inspection of this characteristic equation, and by comparison with equation (5.139), it is 
seen that 


and 


w n 


! K t K c KJ< m 

rjij 


Rb + K m 2 + K t K c KJ<^ 
2 RJ(o n 


Rb + K m 2 + K t K c K.J< m I Tj 

2 V R J K t K c K zKn 


(5.143) 

(5.144) 


It can be seen therefore that introducing integral action has the effect of turning the system 
from a first-order to a second-order one. Also, increasing the amount of integral action 
(specifically, reducing Tj) has the effect of reducing the damping ratio and making the system 
more oscillatory. For example, with the parameters used earlier but with % = 1 , the system 
becomes overdamped (£ = 1.9) and approaches the demand speed slowly (Figure 5.79), 
whereas with T : = 0.01, the system is noticeably underdamped (£ = 0.19) and the speed will 
fluctuate before settling down to its desired value. 


If proportional-integral-derivative (PID) action is used, the corresponding analysis becomes 
rather messy hut eventually results in the following expressions for natural frequency and 
damping ratio: 


« n 


I T t RJ + Tj T D K t K c K 3 K m 


(5.145) 
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and 

Tib + K; n + K t K c KJC m Tib + K° m + K t K c KJC m j T t 

^ ~~ 2 {RJ + T u K t K c K,K m ) co n “ 2 ]/ ( RJ + r D K t K c K l K m )K t K c K a JC m 

(5.146) 


In fact, as this is a first-order system in open loop, it can be seen that derivative action actually 
has no benefit in terms of increasing damping and can actually make stability worse by 
reducing the value of this is not the case for systems with a second-order open-loop transfer 
function, such as the positioning system analyzed earlier. 



Frequency response for harmonic excitation 

This will only be covered briefly, though it forms a more significant part of texts on control 
theory. The response of a control system to a sinusoidally varying (harmonic) input can be 
determined by inserting the complex quantity jw into the system’s transfer function G(s) 
where j is the symbol used by engineers for the imaginary unit (the square root of — 1, often 
termed i by mathematicians). The result is a complex quantity whose real and imaginary parts 
Re(G(/o))) and Im(G(/’w)) will generally vary with the value of circular frequency «. The result 
can either be left directly in complex form, or can be converted into magnitude and phase (or 


modulus and argument) as follows: 

|G(/(o)J = V(Re(G(»)) 2 + (Im(GO))) 2 

(5.147) 

and 


_ _ 1 /Im(G0’0)))l 

arg(G0 ”)) Ue(C(M)J 

(5.148) 


Note that the argument must be placed in the correct quadrant. 

This calculation process may be performed for both the open- and closed-loop situations. The 
results may be plotted in two different forms: 


Bode plot 

This is actually a pair of graphs, presenting modulus and argument of G(j«) plotted against jo). 
Logarithmic axes are used for both variables of the modulus plot and for the frequency axis of 
the argument plot. This is useful in order to identify resonances. Figure 5.80 gives a Bode plot 
for a second-order system, showing its single resonant frequency. 
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Circular frequency (rad/ s) 
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Figure 5.80 Example of Bode plot for a second-order system 



Nyquist plot 

This is a single graph plotted on an Argand diagram (i.e. a representation of the complex 
plane) that presents the locus of the points (Re(GQ'co)), Im(G(jw))) as co is varied from zero to 
infinity. The result is typically a spiral-like plot that approaches the origin as w tends to infinity. 
Although it contains less data than the Bode plot (the dependency upon frequency is lost, 
unless specific points are marked with values of w), it is particularly helpful in understanding 
stability issues. Figure 5.81 shows plots for a typical second-order system and a higher-order 
system. 




o o 

Figure 5.81 Nyquist plot for (a) typical second-order system; (b) typical higher-order system 
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Stability 

Another very important issue is the stability of a control system. It has already been shown that 
a system may respond in an underdamped, critically damped or overdamped manner. However, 
it is also very important to determine whether or not a system exhibits stability, i.e. whether, 
for a bounded input (such as a unit impulse or step function), the output is also bounded, 
rather than (for example) undergoing oscillations which grow in amplitude with time rather 
than decaying. Such behaviour can occur owing to the use of excessive gain in a controller, or 
to time delays or phase lags in the controller or elsewhere in the loop. 

A good example of unstable behaviour of a feedback system is the unpleasant ‘howling’ that 
occurs with a public address (PA) system at a rock concert or in a lecture theatre, when the 
microphone is placed too close to the loudspeaker, or if the volume control (gain) on the 
amplifier is turned up too far. The result is an oscillatory response that builds up until the 
maximum amplitude of the amplifier is reached. 

There are three approaches to determining the stability of a control system, two of which will 
be considered here (the root locus method is dealt with separately in section 5.13). 

Routh-Hurwitz criterion 

This is perhaps the easier criterion to apply theoretically, though its physical meaning is 
not obvious. The closed-loop transfer function is determined as shown in Section 5.9, and 
the characteristic polynomial is found. It is then necessary to discover whether or not the 
characteristic function (obtained by equating the characteristic polynomial to zero) has any 
roots with positive real parts, which would result in a response that would become larger 
(rather than smaller) with time, and the Routh— Hurwitz criterion is a rapid way to find this 
information without going to the trouble of actually finding the roots. The coefficients of the 
characteristic polynomial are used as the basis of figures that are inserted into a table known as 
the Routh array. Let the characteristic function take the form: 

a„s n + a n -\S n ~ 1 + ... + a 2 s 2 + dj-s + % (5.149) 

The following table is completed: 
s n 

s n-l 
s n-2 

S n ~ 3 
s n-4 


where 




or more generally: 



Diagrammatically, the terms are obtained as shown in Figure 5.82. 
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This entry ... Is the product of ... minus the product ... all divided by 


these ... of these ... this entry here 

Figure 5.82 Sequence of calculating a new term in the Routh array 

The number of changes of sign in the left-hand column is equal to the number of roots of the 
characteristic polynomial having positive real parts. In the context of control system stability, if any of 
the roots have positive real parts, the system will be unstable. We can therefore conclude that if there 
are any changes in sign of the left-hand column of the Routh array, the system will be unstable. 

This can be illustrated via the following example. 


Worked example 

Find how many positive roots the following characteristic equation has: 

# - 45 5 - 7s 4 + 44s 3 - 214s 2 + 140s + 400 = 0 (5.150) 

The Routh array is as follows: 


1 

-7 

-214 

400 

-4 

44 

140 

0 

(-7) X (-4) - 1 X 44 . 

(-214) X (-4) - 1 X 140 

400 X (-4) - 1 X 0 

0 

(-4) “ 4 

(-4) - 179 

(-4) - 400 

44 X 4 - (-4) X (-179) 

140 X 4 - (-4) X 400 

0 

0 

4 

4 = 540 

(-179) X (-135) - 4 X 540 

400 X (-135) - 4 X 0 

0 

0 

(-135) “ 163 

(-135) - 400 

540 X 163 - (-135) X 400 

0 

0 

0 

(-163) “ 208 ' 7 


There are four changes of sign in the left-hand column indicating four roots with positive real parts. In fact, this 
example of the characteristic function was generated by expanding the left-hand side of the following equation: 


(s + l)(s - 2)(s + 4)(s - 5)(s - 1 + 3 ;)(s - 1 - 3j) = 0 (5.151) 

so the roots are — 1, 2, —4, 5, 1 — 3 j and 1 + 3 j, with the four roots 2, 5, 1 — 3 j and 1 + 3j all having positive 
real parts. A system with this characteristic equation would clearly be extremely unstable! 


Nyquist stability criterion 

This approach is more intuitive though it involves more numerical calculation. Moreover it 
can be applied experimentally without actually causing the control system to go unstable. The 
Nyquist plot for the open-loop (not closed-loop) transfer function (including the contribution 
from the feedback loop, i.e. G(s)H(s) within Figure 5.27), is plotted as shown in Figure 5.83. If 
the curve encloses the point in the complex plane corresponding to the real number —1 (i.e. the 
point one unit to the left of the origin, on the negative real axis) as shown in Figure 5.83(b), the 
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system will be unstable. The closer the curve comes to enclosing this point, the less stable the 
system will be (meaning in practice that disturbances will die away more slowly). 




Real part Real part 

o o 

Figure 5.83 Nyquist plots of (a) the open-loop transfer function of a stable 
high-order system, showing the phase margin; (b) the open-loop transfer 
function of an unstable system showing how the plot encloses the point -1 

The physical interpretation to this is as follows. If the feedback signal to the differencing 
junction is of opposite sign to the demand signal (or, more accurately, contains a component 
which is of opposite sign), the result from the differencing junction is actually an amplification 
of the input signal, and the net effect is equivalent to positive rather than negative feedback. 

If the amplitude of this ‘positive feedback’ signal is less than the input, the effect is not 
catastrophic as the feedback signal is still attenuated with respect to the input and each pass 
through the system attenuates it further. However, if the opposite-sign (180° out-of-phase) 
component of the feedback signal equals or exceeds the amplitude of the input signal, the 
feedback signal will be further amplified by the system, and a larger signal will be fed back. This 
implies that the signal will continue to grow and that instability will result. 

The practical applicability of the Nyquist stability criterion is that the open-loop Nyquist plot 
can be constructed for a real (but theoretically uncharacterized) piece of control equipment, based 
on experimental response (magnitude and phase) to a sinusoidal input. The Nyquist stability 
criterion provides a very useful tool for predicting the closed-loop behaviour of a piece of 
apparatus without taking any risk of allowing it to become unstable. The amount by which the 
feedback signal would have to be scaled to enclose — 1 on the plot gives a measure of the amount 
by which the gain of the controller could be increased without instability occurring. This is 
known as the gain margin and is usually expressed in dB (decibels, defined as 20 log 10 of the factor 
by which the gain must be increased to cause instability) rather than as the factor itself. 

Another useful quantity is the phase margin. This is the angle between the negative real axis of 
the Nyquist plot and the point at which the curve intersects the unit circle (Figure 5.83(a)). It 
represents the amount of additional phase lag needed in the control system to cause instability. 

Learning summary n 

By the end of this section you should have learnt: 

/ the issues of response, damping and stability; 

/ the meaning of the terms ‘characteristic polynomial’ and ‘characteristic equation’; 

/ to determine the natural frequency and damping of a second-order system from the characteristic 
equation; 

/ the meaning of a Bode plot and a Nyquist plot; 

/ the meanings of gain margin and phase margin; 

/ the principle of the Nyquist stability criterion; 

/ to apply the Routh-Hurwitz stability criterion to a given characteristic polynomial. 
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5.13 A framework for mapping the response 
of control systems: the root locus 
method 

The response of second-order systems is straightforward to analyse 
by drawing on the analogy with a mass-spring-damper system and 
inferring the natural frequency and damping ratio, but for third- 
order (and higher-order) systems, a more sophisticated method of 
conceptualizing the response is required. A useful tool is the root 
locus plot which is used for understanding the different aspects 
of a control system s behaviour, as represented by the various 
roots of the characteristic equation (which was introduced earlier 
in the present section as being the equation obtained by setting 
the denominator of the transfer function to zero). 

The root locus method involves plotting the locations of the roots 
in the complex plane (i.e. plotting their imaginary parts versus 
their real parts) as the gain of the control system is varied, and 
inferring the stability and nature of the response of the system 
from the position of the roots on the resulting diagram. Before 
looking in detail at the generation (and interpretation) of root 
locus plots, it is instructive to explore the physical meaning of 
different types of root, which can be represented in the complex 
plane, shown as (a) to (d) in Figure 5.84: 

(a) positive real roots correspond to an exponentially growing (but non-oscillatory) transient 
response to a disturbance (which is clearly an inherently unstable response since it does not 
converge to a steady value); 

(b) negative real roots correspond to an exponentially decaying, non-oscillatory response that 
dies away to zero with time and can form part of a stable response; 

(c) complex roots, which always occur in ‘conjugate pairs’ with equal positive and negative 
imaginary parts, may have positive real parts and be of the form s = a ± giving an 
oscillatory response of exponentially growing amplitude - clearly another example of an 
unstable response; 

(d) complex roots may have negative real parts and be of the form s = — a ± if} giving an 
oscillatory response of exponentially decaying amplitude — this dies away with time, and 
can again contribute to a stable response. 

A simple example of a root locus plot, relating to a non-control example of a second-order 
system, is the familiar example of a mass-spring-damper system discussed earlier in the 
present section. It can be seen from the denominator in equation (5.134) that this system has a 
characteristic function ms 2 + cs + k, which leads to the characteristic equation (5.136) given 
again here for convenience: 

s2+ m S+ m = 0 (5.152) 

For illustrative purposes let us choose mass m = 1 kg, damping constant c = 10 Ns/m and allow 
spring stiffness k to vary from zero to infinity so that the characteristic equation becomes 

5 2 + 105 + k = 0 (5.153) 

With very small k, the roots are real (for k = 0, the roots are 5 = —10 and 5 = 0), and the 
response to an initial input (such as a step or impulse) is heavily damped, decaying slowly but 
without oscillation. As the spring constant is made larger (in this case 25 N/m) the system 



Figure 5.84 Typical responses corresponding to 
positions of roots in the complex plane 
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becomes critically damped, and the roots become equal 
but still real with the value, —5. A slightly larger value of 
the spring constant causes the roots to break away from the 
real axis to form a conjugate pair of complex roots, and the 
response becomes oscillatory. Further increases to the spring 
constant make the imaginary parts of the roots larger and 
the oscillations are faster but they do not die away any more 
slowly with time. The locus of the roots as the value of k 
varies is shown in Figure 5.85. 

The same concepts can be applied to control systems, whose 
behaviour may or may not be oscillatory. In practice, it is 
found that the response of a system can vary between any or 
all of the kinds of behaviour shown in Figure 5.84 as the gain 
of the controller in a closed-loop control system is varied. 

In order to understand the relevance of the root locus 
method to control systems, recall that a typical control system 
has an open-loop transfer function G(s)H(s) as shown earlier in Figure 5.27. Furthermore, the 
effect on the transfer function of closing the loop has also been demonstrated (Figure 5.26 and 
equations (5.45) to (5.47)). Now let us assume that the transfer function of the main (forward) 
path involves a variable gain term K, arising for example from varying the gain of a controller 
lying within that forward path, and two polynomials Z G (s) and P G (s) so that 



system with varying spring constant k 


G(s) = K 


Z G (s) 

P G (s) 


( 5 . 154 ) 


Similarly the transfer function of the feedback path can be expressed in terms of two more 
polynomials Z H (s) and P H (s): 


H(s) = K~ 


so that the open-loop transfer function is 
Z G (s)Z H (s) 


G(s)H(s) = K 


P G (s)P H (s) 


Z(s) _ (s - Zl)(* - z 2 )...(s- Z„ z ) 
P(s) - K {s - Pl ){s - p 2 )...(s~ p„ p ) 


where P(s) = P G (s)P H (s), Z(s) = Z G (s)Z H (s). Also, s = z lt s = z 2 . . . s = z„ z are the roots of the 
equation Z(s) = 0 and are known as the zeros of the open-loop transfer function, while 
5 = P\,s = p 2 , ... s = p m> are the roots of the equation Z(s) = 0 and are known as the poles of 
the open-loop transfer function. 


By inserting equations (5.154) and (5.155) into equation (5.47) giving the closed-loop transfer 
function in terms of G(s) and H(s ) , the following is obtained: 


Closed loop transfer function = 


G(s) 

1 + G(s)H(s) 


P G (s) 


Z G (s)Z H (s) 

P G (s)P n (s) 


( 5 . 157 ) 


KZ g (s)P h (s) _ KZ g (s)P h (s) 

P G (s)P H (s) + KZ g (s)Z h (s) P(s)+ KZ(s) 


The closed-loop response of the system is governed by the roots of the characteristic equation, 
which once again is obtained by setting the denominator of the closed-loop transfer function 
to zero: 

P(s) + KZ(s) = (s - Pl )(s - p 2 )...(s - Pnp ) + K(s ~ z t )(s - z 2 )...(s - z„ 7 ) = 0 ( 5 . 158 ) 
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Now consider three possible situations: 

(1) When K is very small (i.e. K tends to zero), P(s) is much larger than KZ(s ) so the 
characteristic equation can be approximated as 

(s- Pl )(s-p 2 )...(s- Pnp ) = 0 (5.159) 

whose roots are simply the open-loop poles of the system. 

(2) When K is very large (i.e. K tends to infinity) and s is not large, P(s) is much smaller than 
KZ(s) so the characteristic equation can be approximated as 

(s-z 1 )(s-z 2 )...(s-z nz ) = 0 (5.160) 

whose roots are simply the open-loop zeros of the system. However, there will also 
be some very large complex roots to the characteristic equation which (for very large 
complex s) can also be approximated by expanding it out and truncating the expansions of 
the two sets of brackets after the terms involving s"p -1 and s n z _1 respectively: 

j«p-i«p-ip 1 -5»p-ip 2 ...-5»p - 'p np + K{s n z ~ fT. - ^ - s"z - l z 2 ... ~ s"z ~ 'z„ 7 ) = 0 

(5.161) 


s" p 1 (s — pi — p 2 ■ ■ ■ —p^) + Ks n z l (s — z\ — z 2 ... ~z„ 7 ) = 0 (5.162) 

Noting that s is assumed to be large, equation (5.156) can be further approximated (via 
a binomial expansion of the form (1 — a) -1 = 1 + a + a 2 ... ~ 1 + a for small a) and 
rearranged to give 



Taking the (tip — n z ) th root: 



(_ K )»p-nz ( 5 . 16 4 ) 


Noting that (1 + a) i/n = 1 + a/n + a 2 /n 2 + ... ~ 1 + a/n for small a, and re-expressing 
the (n P - n z )th roots of ( ~K) in terms of the (« P - n z )th roots of (—1): 



There are of course (n P — n y ) possible values for the (rip — rt z ) th root of (—1). Rearranging 
equation (5.165) and expressing the various (rip — n z ) th roots of (—1) in polar form: 


Pi + P2---P„ p ~ *1 + *2---*n z 
Mp - n z 


R"p-" z 


(co s ( 


(2k + 1)tt \ 
, n P ~ n z } 


:^)i 


(5.166) 

where k = 0 ... (tip — n z — 1). In other words, for large K, the roots of the characteristic 
equation tend asymptotically to large multiples of the (n P — « z )th root of —1, shifted by a 
term sometimes referred to as the centroid of the poles and zeros. 

(3) When K is neither very small nor very large, the roots of the characteristic equation will 
take some intermediate values which may be real or complex. The purpose of the root 
locus method is to show graphically how the roots move between the open-loop poles and 
the open-loop zeros or the large complex values (asymptotes). 

The rules for plotting the root locus of an arbitrary closed-loop transfer function were 
originally proposed in a paper by Evans (1948). They are given in detail, though in slightly 
different forms and orders, in any comprehensive text on classical control theory (e.g. Dorf 
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and Bishop (2008), who use a slightly different convention for poles and zeros, or Van de 
Vegte (1993)) and the reader is referred to such a text. A summary of the rules, with minor 
differences from the approach presented here, is given on the RoyMech website, along 
with a number of examples. The rules are summarized here (without further proof) to act 
as a starting point, noting that this version of the rules does not cover all contingencies. 

(1) The root locus plot is symmetric about the real axis, because complex roots occur in 
conjugate pairs. 

(2) The number of paths on the root locus is equal to the number of open-loop poles n P 

(3) The paths start (for K = 0) at the open-loop poles and are normally marked on the 
complex plane with X symbols. 

(4) n z of these terminate at the open-loop zeros, which are marked on the complex plane 
with O symbols. 

(5) The remaining n P — n z paths do not terminate, but tend towards asymptotes oriented at 
angles 0 to the real axis where 

(2 k + 1)tt 

0 = — where k = 0, 1 ... (n P - n z - 1) (5.167) 

«p «z 

so that (for example) if there are five poles and two zeros there will be three asymptotes 
oriented at tt/3, tt and 5 tt/ 3 radians (60°, 180° and 300°) to the real axis (Figure 5.86a). 
Similarly, if there are five poles and one zero there will be four asymptotes oriented at tt/4, 
3tt/4, 5tt/ 4 and 7 tt/ 4 radians (45°, 135°, 225° and 315°) to the real axis (Figure 5.86b). 
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Figure 5.86 Typical asymptotes for (a) n P = 5,n z = 2 and (b) n P = 5, n z = 1 


(6) The asymptotes radiate outwards from the point on the real axis sometimes known as the 
centroid of the poles and zeros: 


Pi + Pi • • • + p np ~ zi + z 2 ... - z„ z 
k p - n z 


(5.168) 


(7) Root loci can be drawn on the real axis in the regions to the left of an odd number of 
poles and zeros. If there are an even number of poles and/ or zeros on the real axis to the 
right of a given point, no root locus is present at that point. 

(8) Where paths come together along the real axis (or converge back onto the real axis), the 
values of s at the ‘breakout’ and ‘breakin’ points (collectively known as ‘singular’ points, 
where there are multiple roots at a single point) are given by rearranging equation (5.158) 
to give K in terms of s then solving the equation: 

fr = 0 (5.169) 


Note that not all solutions to equation (5.169) will necessarily correspond to breakaway 
points. 
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(9) Where a path starts at a complex pole, the angle at which it leaves the pole, plus the sum 
of each of the angles to the real axis (horizontal) direction of the present pole from each 
of the other poles, add up to an angle ( 2k + 1)tt radians where k is a real integer (in other 
words, add up to an odd multiple of Tr).This is best illustrated via the example shown in 
Figure 5.87. 



Figure 5.87 Finding the angle of departure from a complex pole 


The angles of arrival of the paths at complex zeros are found by a process that is effectively 
the opposite of the above (the angle of arrival at a zero, plus the angles from the other 
zeros, minus the angles from the poles adds up to an odd multiple of tt). 


(10) The loci cross the imaginary axes at points that can be determined by replacing s by j'w 
and solving. It may or may not first be necessary to find the limiting value of K for stability 
(for example, by applying the Routh— Hurwitz stability criterion) before solving to find the 
imaginary roots. This can be a litde tedious so will not be explored in detail here, though 
the worked example includes a simple illustration. 

(11) Once we have plotted the root locus, the value of K can be chosen to ensure that the roots 
stay well away from the right-hand part of the complex plane. If a given point on the root 
locus is chosen, the corresponding value of K can be found by the product of distances of 
that point from the poles divided by the product of distances of that point from the zeros: 


\s~Pi\ X \s-p2\X...X\s-p nf \ 

k-*il x k-* 2 | x-x 


(5.170) 


If there are no zeros, the denominator is unity. 

With the availability of computing packages such as MATLAB, root locus plots can be 
drawn automatically and accurately without the need to apply the rules given above. An 
online root locus plotting Java applet is available (see References). It is nonetheless useful 
to be able to obtain a graphical visualization of a system’s behaviour without the need for 
computational effort. 

The power of the root locus method lies in the fact that an experienced control 
system designer can choose a suitable point on the root locus plot that will give a good 
compromise between stability and responsiveness; the corresponding value of K can then 
be calculated from rule 11 above. 
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Worked example 

Plot the root locus of a closed-loop system having the following open-loop 
transfer function: 


GWH(,) = ( 8 , + 2 ,f 2) (, + 2) (5.171) 

This can be expressed as 

G « H « = fr+ l+;)(i+l -;)(, + 2) < 5 ' 172 > 


which has three poles at s = (—1 + j), s = (—1 — j) and s = — 2, and no zeros. 
Following the rules above: 


■ (1) Symmetry about the real axis will be assumed and is satisfied by the open-loop poles. 

I (2) There are three open-loop poles so there are three paths on the root locus plot. 

I (3) The paths start at the open-loop poles 5 = (— 1 +j), s = (— 1 — j) and s = —2, 
which are marked on the diagram with X symbols (Figure 5.88(a)) 




This departure angle which 
we are trying to find ... 


... plus these angles which 
can be found from geomet 


. . . minus the angles from any\ 
zeros (none in this case) . . . 

. . . add up to it radians (could 
have been 3tt or 5tt . . .) ... 

... so the departure angle 
must be n/4 

o 






Figure 5.88 (a) Location of poles; (b) asymptotes; (c) angle of departure of path from pole 
(-1 + j); (d) completing the root locus plot including intersections with imaginary axis 
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(4) There are no zeros to be marked on the plot, so all of the paths end in asymptotes 
rather than at defined positions. 

(5) k p = 3, n./ = 0, so there are three asymptotes at angles tt/3, tt and 577/3 radians (or 
60°, 180° and 300°) to the real axis (Figure 5.88(b)) 

(6) The asymptotes radiate outwards from the point: 


_ (—1 +i) + ("I ~ j) + (~2) _ 4 

5 3 3 


(5.173) 


as shown in Figure 5.88(b). 

(7) The only point at which a path can be present on the real axis is to the left of the 
root s = —2, because only in this region is there an odd number of poles and zeros 
(in this case, only one pole and no zeros) to its right on the real axis. By enforcing 
this rule it is seen that this particular path starts at the pole s = —2 and extends 
infinitely along the negative real axis (Figure 5.88(c)). 

(8) There is no breakout or breakin as the paths in this case either lie on the real axis or 
away from it. 

(9) The angle of departure of from the pole s = (—1 + j) (measured relative to the real 
axis), plus the total of the angles of that pole from the other poles, minus the total of 
the angles to that pole from the zeros, must add up to an odd multiple of tt. It can 
be seen from Figure 5.88(c) how this is applied in the present case, with the total of 
the angles simply being tt itself. The other angles are (in this example) simple to find 
geometrically, so the angle of departure is found as 


TT TT TT 

"“T - 2 - 4 

By symmetry about the real axis, the angle of departure from the pole s = (—1 — j) 
must be — tt/4. 


(10) Two of the paths cross the imaginary axis after leaving the complex poles. At this 
stage it is noted that (by inserting the open-loop transfer function into (5.47), 
expanding the bracketed terms and rearranging) the characteristic equation of the 
closed-loop system can be expressed as 

s 3 + 4s 2 + 6s + 4 + K = 0 (5.174) 

Making the substitution s = jut gives 

-jw 3 - 4w 2 + 6jut + 4 + K = 0 (5.175) 

Taking the real part of equation (5.175) and rearranging gives 

6w = m 3 (5.176) 

(j) is therefore either zero (not a valid solution) or ±V6 rad/s, so the loci intersect the 
imaginary axis at the points s = ±V(>j = ±2.44/. Now taking the imaginary part of 
equation (5.175), rearranging and inserting this value of u> gives the value of gain for 
limiting stability: 

K = 4w 2 — 4 = 4X6 — 4 = 20 (5.177) 


At this stage, the root locus plot can be completed as shown in Figure 5.88(d), 
with the two complex paths leaving the two complex poles at ±tt/4, crossing the 
imaginary axis at ±2.44/, and converging on the asymptotes at tt/3 and 5tt/3. A 
control engineer may then choose (by eye and experience) a point on the plot for 
which the gain can then be calculated from the distances of that point to the poles 
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and zeros. For example, if point (—0.5 + 1.65/) on the locus is chosen by eye (Figure 
5.89), K is calculated as 

product of all distances from chosen 5 to poles 
product of all distances from chosen s to zeros 

|j + 1 - j | X |j + 1 +j\ X |s + 2 | |0.5 + 0.65/| X |0.5 + 2.65j| X |1.5 + 1.65/| 

“ I m 1 

= 4.93 (5.178) 

noting that (in this case) the denominator is simply 1 as there are no zeros. 



Figure 5.89 Finding the gain of a system 
corresponding to a point on the root locus plot 


Applications of the root locus method 

The root locus method can be used to explore further two of the examples from earlier in the 
present unit, namely the position controller and the antenna system, with particular reference 
to the stability and oscillatory response of the systems. 


It has already been shown from Figure 5.50 that the open-loop transfer of the position 
controller system with a simple proportional amplifier is 


F P (i) _ K 3 K ru K s 
V D (s) Js 2 


(5.179) 


The root locus is very easy to plot (for varying amplifier gain) as we can immediately see: 


• there are two poles, both of value s = 0, so there are two paths; 

• as the poles are equal, they also form the breakout point; 

• there are no zeros, so there are 2 — 0 = 2 asymptotes; 

• the centroid of the poles is 5 = 0 so the asymptotes radiate from the origin at an angle 
±tt/ 2 to the real axis. 


Without further ado, the root locus plot can be drawn as shown in Figure 5.90, showing the 
poles of the closed-loop transfer function as the amplifier gain is increased. 

This illustrates that the root locus lies in the imaginary axis — in other words, the response is 
always oscillatory and is on the margin of stability, with any input resulting in a position that 
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oscillates at a frequency dependent upon the amplifier gain, with the oscillations 
never dying away. However, if as before the simple amplifier is replaced with 
a PID controller/amplifier with gain K c and proportional and integral time 
constants T\ and T D respectively (and this time ignoring any additional external 
load), the open-loop transfer function now becomes 

V D (s) Js 2 Js 3 

Once again, assuming, that T D = 0.5 second and 7) = 1 second, and assuming 
also the system parameters chosen in Section 5.11, the open-loop transfer function 
becomes: 

V P (s) K c K m KJ n (s 2 + 25 + 2) K c K m KJ n (s + 1 + j)(s + 1 - j) 
V D (s) ~ Js 3 " Js 3 

This system now has: 

• three poles, all at s = 0 


lmb)f 


2 - 

5 = 0 (2 roots) 


-2 -1 

-2 ; 

1 Re(5) 


Figure 5.90 Root locus plot 
for positioning system with 
proportional control 


• two zeros, at 5 = (—1 + j) and s = (— 1 — j) 

• two paths that arrive at the zeros at angles that can be found in a similar way to that for 
angle of departure from poles: sum of angles from zeros — sum of angles from poles comes 
to (2fe + 1 )tt for integer k, i.e. an odd multiples of tt. In the case of the root at 

s = (—1 — jf), try k = —1 so angle = — tt — (tt/2 — 3 X 3tt/4) = 3tt/ 4, and by symmetry 
the angle of arrival at the other complex zero is — 3 tt/ 4. It can also be shown that the angles 
of departure from the three coincident poles at the origin are tt and ±tt/ 3, though this is 
not obvious from the rules already described 


• one asymptote at angle Tt radians, radiating from the point 5 = 2. 

The root locus plot can now be drawn as shown in Figure 5.91. 

It is seen that, as the gain is increased, the roots move into the right-hand 
half of the plane and the system becomes unstable; however, for larger 
values of K the system becomes stable again as two of the closed-loop 
poles tend to the open-loop zeros in the left-hand half of the complex 
plane (indicating decaying oscillatory responses) and the third closed-loop 
pole tends to — oo, indicating a rapidly decaying monotonic response. 

This example shows the usefulness of being able to visualize what is 
going on within a system that may initially seem to exhibit puzzling or 
counterintuitive behaviour, in this case showing unstable behaviour at small 
gains and becoming stable for larger gains. The practical implication of this 
is shown in Figure 5.92, which shows the response for gains of 1 and 10 
respectively. 

The root locus plot can also be drawn for the radar antenna problem. 

The block diagram in Figure 5.64 may be rearranged slightly by placing 
the common factor of K t inside the feedback loop as shown in 
Figure 5.93. 



Figure 5.91 Root locus plot for positioning 
system with PID control 


After multiplying out the terms on the forward path, and substituting for proportional plus 
integral action, the following open-loop transfer function is obtained: 


Open loop transfer function 


G c (s)K t K A K m 
RJs + Rb+ Kj 


K c ( 1 + ]/(T lS ))K t K ;i K m K c K t K ;i K nl (s + 1/7)) 
RJs + Rb + K m 2 - RJs{s + (Rb + Kj)/RJ) 
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Figure 5.93 Rearranged block diagram for complete 
antenna system 


This is now a second-order system with poles at s = 0 and s = — ( Rb + K m 2 )/RJ, and a zero 
at 5 = — 1/Tj. Using the same system constants as before, but with 7’, = 0.2 seconds (as this 
involves more interesting behaviour than other values which are arguably more suitable), the 
poles are now at 5 = 0 and 5 = —2, and the zero is at 5 = —5. The root locus plot shown in 
Figure 5.94 can he drawn. 
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This shows that, as the gain is increased from zero, the behaviour is initially overdamped, then 
becomes (slightly) underdamped, then becomes overdamped again. Such behaviour would not 
he obvious from examining the open-loop transfer function! The practical implication of this 
is shown via plots for K a = 0.1 (£ = 1.48), 4 (£ = 0.67) and 20 (£ = 1.1) as shown in Figure 
5.95. 



Learning summary 

By the end of this section you should have learnt: 

/ to draw root locus plots and interpret them in terms of system stability and response. 
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UNIT OVERVIEW 

■ Introduction 

■ Natural frequencies and mode shapes 

■ Response of damped single-degree-of-freedom systems 

■ Response of damped multi-degree-of-freedom systems 

■ Experimental modal analysis 

■ Approximate methods 

■ Vibration control techniques 


6.1 Introduction 

Many people encounter vibration for the first time through the music produced by stringed 
and percussive instruments, or through the speakers of their radios or the earphones of their 
iPods. All of these produce their sounds by vibration. While these provide pleasure to the user, 
vibration in most engineering structures is something to avoid. 

This unit will describe some basic causes and effects of vibration and show you how to model 
the vibration of structures mathematically. 

A recurring theme is resonance. A key observation is that if the frequency of an alternating 
force coincides with a resonant frequency, large amplitude vibrations will result. 

Large alternating displacements often produce large alternating stresses. 

It is no surprise, therefore, that resonance-induced fatigue is a major 
cause of in-service failures — normally when the effects of vibration 
have not been considered adequately at the design stage. 

In design, it is common to calculate the stresses due to static loads and 
to use a so-called ‘factor of safety’ to make allowance for effects that 
have been excluded from the analysis. An alternating force that induces 
resonance can easily produce stresses that are 100 times those that a 
static force of the same magnitude would give. Typical factors of ‘safety’ 
range from 1.5 to 4! 

The most dramatic failure caused by resonant vibration was probably 
that of the Tacoma Narrows Bridge in New York State. It collapsed in 
truly spectacular fashion in 1940, after only four months’ use. From its 
opening, bending waves were set up if the wind speed rose and people 
would drive across just for the fun of the experience (Disney World 



Figure 6.1 Wind-induced torsional vibration of 
the Tacoma Narrows bridge shortly before it 
collapsed 
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didn’t exist in 1940!). However, on 7th November a wind speed of 42 mph produced a severe 
torsional vibration. At its most extreme, the sides of the bridge were moving up and down by 
over 8 m. Take a look at the video at www.tinyurl.com/2ca5mh. Seeing is believing. 

Designers now understand about the vertical bending and torsional vibration that can be 
caused by wind or by loads moving across bridges. Or at least they thought they did, until the 
opening of the Millennium Bridge in London showed that there was still more to learn. It 
was built for pedestrians and cyclists, and the designers had taken account of the vertical forces 
from their feet as they walked across. A big crowd had gathered for the opening ceremony and 
when they started to walk across, the bridge began to sway from side-to-side. No structure 
is completely rigid and the crowd found that the bridge swayed slightly from side to side. To 
help steady themselves, they moved their feet apart as they walked and in doing so exerted 
a small alternating horizontal force onto the bridge with each step. The problem was that 
everyone responded in the same way at the same time and the combined effect of the forces 
from all the feet was sufficient to excite a horizontal bending mode of the bridge. Take a 
look at the video at www.tinyurl.com/33xeyle.The problem was cured by a combination of 
viscous dampers and tuned-mass absorbers. 

Section 6.2 explains how to calculate the resonant frequencies of 
structures and see how structures deform when they vibrate; their 
so-called ‘mode shapes’. Structures only vibrate if they are subjected 
to some form of excitation and Section 6.3 considers the response 
of structures that exhibit only one resonant frequency. Section 6.4 
extends this to structures with many resonant frequencies. Even the 
most sophisticated computer analysis has to make assumptions and 
simplifications that may not reflect the actual properties of the real 
structure. Section 6.5 describes how a structure’s resonant frequencies 
and mode shapes can be measured experimentally. At the preliminary 
stage of a design, approximate values can help determine whether 
vibration is likely to be an issue, and Section 6.6 introduces some 
useful techniques for obtaining these. Finally, Section 6.7 introduces 
two common techniques for controlling the potentially damaging 
effects of vibration. 


6.2 Natural frequencies and mode shapes 

Introduction 

Anyone who has tapped a bottle or cup will know that it ‘rings’ at a characteristic frequency. 
We hear a sound because the tapping has excited one or more natural frequencies. As noted in 
Section 6.1, resonance-induced fatigue is a major cause of premature structural failure, and this 
section discusses several ways of modelling structures so that their natural frequencies can be 
calculated. 

Before doing so, however, we introduce the classification of structure in terms of degrees of 
freedom. These can be thought of as ‘possible motions’. The number of degrees of freedom for 
a structure: 

• is related to the number of resonant frequencies it has; 

• affects the number of motion coordinates needed to define its position. 

A rigid body can have up to six degrees of freedom: three translations and three rotations. In 
addition to the six rigid body freedoms, a flexible body can have other possible motions, such 
as bending and twisting, or flexibility between components. 



Figure 6.2 The crowd of pedestrians who had 
gathered for the opening of the Millennium 
Bridge in London caused a pronounced side-to- 
side vibration when they began to walk across 
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A key step is to decide how many degrees of freedom are appropriate. With a few exceptions 
(such as a spacecraft), structures are constrained so that their freedom to move is limited. Even 
so, this can still leave perhaps thousands of possible ways in which a structure might vibrate 
(its so-called modes of vibration). In most cases, we are only interested in a few of these 
(possibly in only one mode) and we will look for a mathematical model that concentrates on 
these and omits other possible motions. If, for example, we wanted to study the way an aircraft’s 
wing vibrates when it hits an air pocket, we would ignore the fact that this also caused a light 
fitting in one of the toilets to vibrate! On the other hand, if we were designing the light fitting, 
we would want a mathematical model that described how it responded to motion of the panel 
that it was attached to. In this case, we would probably just model the fitting and the panel, but 
wouldn’t include every other part of the aircraft. 

In summary, we can say that we aim to model the particular motions we are interested in. We 
will start by looking at structures that can be characterized by just one type of motion. We will 
then move on to those that can exhibit two or more types of motion. 

The mathematical descriptions we will develop relate the motion of the structure to the forces 
that cause it. Central to this process is Newton’s second law of motion. This links cause 
and effect. It applies only to rigid bodies and allows us to relate the applied forces (and/ or 
moments) to the motion that they produce. 

There are two forms: one for translation, one for rotation. These were covered in Unit 6 in 
Part 1 and you may want to review that unit before proceeding. 

For linear translational motion, Newton’s second law can be written as: 


Resultant force in the direction of _ ^ Absolute acceleration of 

the acceleration the centre of mass 

It’s best to think of it in words, not as ‘F = ma’. In problems, you should write it down as 
it’s said, noting always that it’s a vector equation. Put the forces (resolved into the vector 
direction that you have chosen for acceleration) on the left of the equals sign and the mass and 
acceleration terms on the right. Don’t mix the two together. ‘Mass X acceleration’ is not itself a 
force. It describes the effect that the actual forces have on the mass in question. 

Newton’s second law for rotational motion can he expressed as: 


Resultant torque (moment) Moment of Absolute 

about the axis in the = inertia about X angular 

direction of the acceleration the axis acceleration 

It is important to note that ‘the axis’ about which the torque is taken must he one of the 
following. 

• a fixed axis 

• an ‘instantaneous centre of rotation’ (this is a point on a body that is stationary for an instant, 
such as the point of contact of a tyre rolling along the road) 

• the centre of mass. 

No other axis is allowed. 

As with the linear equation, think of the words and write it down as it’s said. 

Structures with one dominant mode of vibration 

Initially, we will look at structures that can be modelled with just one degree of freedom. This is 
the simplest model for a vibrating structure. The model normally takes the form of a rigid body 
restrained by one or more massless springs. It works well for structures with one resonance or 
where one resonance dominates the vibration behaviour. It gives good insight into vibration 
behaviour and is often used as a first approximation for more complicated structures. 
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We will look at a number of different examples where single-degree-of-ffeedom dynamic 
models are appropriate, but the general approach to the analysis will be the same in all cases and 
will involve the following steps: 

(1) Convert the physical structure into a dynamic mass-spring model. 

(2) Draw a free-body diagram. Displacing the body from its equilibrium position will create 
forces in the restraining springs that will try to return the body to equilibrium. The free- 
body diagram can be thought of as a ‘snapshot’ of the state of the system when it has 
moved away from equilibrium by a chosen amount. Drawing the free-body diagram is the 
key step in the analysis and should be tackled systematically. There are three stages: 

(i) Start with the system in equilibrium and draw it as a free body. To create the ‘free’ 
body, draw it without any of the restraining springs. The forces exerted by the springs 
will be added in stage (iii). 

(ii) Select a motion coordinate to describe how the system will deflect from its 
equilibrium position and mark it on the diagram. 

(iii) Apply a positive deflection in the chosen motion coordinate, identify the forces (and/ 
or moments) that result and draw them on the diagram. It is critical that the positive 
directions of the forces due to a positive deflection are shown correctly. 

(3) Apply the appropriate form of Newton’s second law of motion to give the equation of 
motion for the system. 

The following pages show how steps (1), (2) and (3) can be applied to a number of examples. 
You should adopt the same systematic approach using the three steps when setting up solutions. 

At this stage, we will concentrate on finding the natural frequency of the systems. This is the 
frequency at which a system will vibrate when displaced from equilibrium and then released. 
Later, we will consider the effects of external excitation and damping, but these are omitted for 
the moment. 

When we look at the effects of excitation, we will find that, for most engineering structures, 
there is a maximum response if the excitation frequency coincides with the natural frequency 
(which is why we need to know what its value is). This effect is called resonance and the term 
resonant frequency is often used instead of natural frequency (although, strictly speaking, the 
two are different as we shall see later). 


Example 1: Simple mass-spring system 

If a mass m [kg] is suspended from a spring of stiffness k [N/m], it 
will move down under the effect of gravity and stretch the spring 
by a distance x cq before reaching its static equilibrium position 
(Figure 6.3). 

Once it is in equilibrium, the force in the spring (given by the 
stiffness multiplied by the change of length) exactly balances the 
gravitational force on the mass and hence: 

mg = kx eq 

What then happens if the mass is given a further downward 
displacement, x, away from equilibrium and then released (Figure 6.4)? 

The displacement x produces an additional force in the spring of kx, 
as shown in Figure 6.4(b). Since we know from the static 
equilibrium case that mg = kx eq , the resultant force on the mass is kx, 
as shown in Figure 6.4(c). 

Remembering that the displacement and the force are both vectors, 
we see that a positive downward displacement from equilibrium 
produces a positive upward resultant force on the mass. This acts to 


Unstrained 

position 


Figure 6.3 
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return it to its equilibrium position. At any instant, when x was negative (meaning that the 
mass was above its equilibrium position), the ‘upward’ resultant force, kx, would also be negative, 
telling us that the force was actually acting downwards at this instant. Since the force always acts 
to return the mass to equilibrium, the term restoring force is often used. 

So, what happens when the mass is displaced downwards (x positive) away from equilibrium 
and then released? Initially, there will be an upward resultant force on the mass, so it will 
accelerate upwards. Since downward movement has been chosen as positive, upward velocity 
and upward acceleration are both negative. When the mass reaches its equilibrium position 
(point A in Figure 6.5), x is zero and the resultant force and acceleration are zero as well. At 
this point, the velocity has its maximum upward (i.e. negative) value. This upward velocity 
carries the mass past this point and it moves above the equilibrium position. So x now becomes 
negative and the resultant force then acts downwards and slows the mass down (acceleration is 
downwards). At point B, the mass reaches its maximum upward (i.e. negative) displacement and 
is instantaneously at rest. Since x is still negative, the resultant force continues to act downwards. 
The mass passes back through the equilibrium position (point C), when it has its maximum 
downward (i.e. positive) velocity. Then x becomes positive once more and the upward resultant 
force slows the mass down until it reaches its original starting position (point D). 



Displacement 

Velocity 


We see that the resultant force on the mass depends only on the displacement measured from 
the equilibrium position. Here, and in all other problems, the static forces in springs exactly 
balance any gravitational forces on the mass under equilibrium conditions. Because they 
always cancel each other, we ignore them and start at the equilibrium position and consider 
displacements away from that position. 
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Note: Throughout this unit, we will assume that displacements are small and that the spring 
stiffnesses are constant. This means that the force in a spring varies linearly with displacement. 
Such systems are called linear systems. If displacements are not small, the stiffness may vary 
with displacement, but this is not considered here. 

To create a mathematical model of the problem, the first step is to replace the physical system 
(Figure 6.6) with a dynamic mass-spring model (Figure 6.7). In this simple case, the mass is 
represented by a square block and the coil spring by a zig-zag representation. 

Step 1 Dynamic mass— spring model 



Figure 6.6 Physical system 


Figure 6.7 

Step 2 Free-body diagram 

(i) Remove the spring to leave the mass by itself. 

(ii) Mark the chosen positive direction for displacement. 

(iii) Give the mass a positive displacement, write down the expression for the force and add it 
to the diagram to show its positive direction. 


kx 

D . 


Step 3 Equation of motion 

Applying Newton’s second law, we write down the resultant force in the chosen direction of 
motion (downwards in this example) and equate to the mass multiplied by the acceleration. Hence, 
-kx = mx 


This can be rearranged to give an equation in the form of a second-order ordinary differential 
equation with constant coefficients. 

mx + kx = 0 ( 6 . 1 ) 

Any sinusoidal function satisfies this equation, but from the earlier description of what happens 
to the mass when it is displaced downwards and then released, a suitable mathematical form 
would be x(t) = Xcos cut. This describes a sinusoidally varying displacement at frequency to, with 
maximum deflections of X above and below the equilibrium position.The frequency of the 
vibration is called the natural frequency and is the characteristic quantity that we are trying to 
find. Sinusoidal displacement like this is often referred to as simple harmonic motion. 
Differentiating x(t) = X cos bit, we get x = — coX sin wt and x = — ai 2 X cos bit. 

Substitute for x(t) and its derivatives into the equation of motion. 

— worX cos bit + kX cos at = 0 


Cancelling X cos bit. 


-DM 2 + k = 


0 


!~k 

The natural frequency for this system is therefore given by y — .The symbol, is normally 
used for the natural frequency. When substituting numerical values into the equation of 
motion, « must have the units of rad/ s to make the equation consistent. However, the value 
would normally be quoted (in a report, for example) using the units of Hz (hertz, or cycles per 
second). The two are linked by the equation: 

/n[Hz] = [rad/s] 
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You will find that other systems have different equations of motion, but provided that we can 
assume that displacements remain small, all will have the same form, namely: 

Mz + Kz = 0 

where z is the chosen motion coordinate. Following the above analysis, we find that the natural 
frequency is given by 

= [rad/s] 

Hence, as soon as you’ve derived the equation of motion and obtained the coefficients of the 
displacement and acceleration, you can immediately write down the expression for the natural 
frequency of a system. 

Example 2: Vertical vibration of a block on a flexible cantilever beam 


The system can be made to vibrate by lifting the block up (causing 
the supporting beam to bend) and then releasing it, resulting in an up 
and down vibration. The block will be treated as a rigid mass and the 
supporting beam as a massless spring. The bending stiffness of the 
3 EI 2 

beam is given by k a = Note that the symbol I 2 in this formula is 
the second moment of area for the beam. Since displacements are 
assumed to be small, we will neglect the rotation of the block caused 
by the bending of the beam and consider only the vertical translation 
of the mass. The example on page 402 shows one way of including the 
effect of the rotation in the analysis. 

Step 1 Dynamic model 



Figure 6.9 Physical system 


Li 








Bending stiffness, fe B 



Figure 6.10 


Dynamically, this case is identical to the first example and following the same procedure of 
free-body diagram and equation of motion, the natural frequency will be found to be 


0 )„ = 


(6.2) 


Example 3: Torsional vibration of a block on a cantilever beam 

The block and beam from the previous example can also be made to vibrate by rotating the 
block about the axis of the support beam and then releasing it, resulting in torsional vibration. 
In this case, the support beam can be modelled as a spring with torsional stiffness 

CJ 

k-y = —j~ (with units of Nm/rad).The block can again be modelled as a rigid body, but with a 

moment of inertia I about the beam axis, which we will assume to be a fixed axis. Methods of 
calculating the moment of inertia of rigid bodies are covered in Unit 6 of Part 1. 
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Step 1 Dynamic model 



Thumb points in 
the vector direction 


Figure 6.11 


It is convenient to use rotational vector notation in this example. As shown in Figure 6.11, the 
angular displacement vector acts along the axis of rotation in a direction given by the ‘right- 
hand rule’. 

Step 2 Free-body diagram 

If the block rotates by an angle 0, the induced torque in the support beam will be fe x 0. 9"" 

This will act in a direction to oppose the imposed rotation and to return the block to 
its equilibrium position. 


Step 3 Equation of motion pig ( 

Applying Newton s second law for rotation, we write down the resultant torque in the 
chosen direction of motion (to the left in this example) and equate to the moment of inertia 
multiplied by the angular acceleration: 


From the coefficients in the equation of motion, the natural frequency for this system is: 


Examples Rocker system 

This example consists of a rigid, massless bar with a fixed pivot at one end and a large mass 
attached at the other. The rocking motion about the pivot is restrained by two springs, one 
attached to the mass and the other that is connected to the bar at point B. 

Note: We will use the angular displacement of the bar about the fixed pivot as the motion 
coordinate and assume that this displacement is small. This means that cos 0 ~ 1 and 
sin 0 ~ tan 0 ~ 0. Taken together with the earlier assumption that stiffness values are constant, 
the linear dependence of the spring forces on the motion coordinate, 0, will be maintained. As 
a result, the model will give an equation of motion that is a second order ordinary differential 
equation with constant coefficients. 


Rearranging, 


/0 + fe T 0 = 0 



(6.3) 
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Step 1 Dynamic model 


Step 2 Free-body diagram 



K 2 L 2 0 



Kj Lj 0 


Figure 6.14 

For a positive rotation of the bar, point B will move up by L, 0.This puts the spring K x into 
tension, resulting in a downward force on the bar. Point C will move up by 1^0, putting the 
spring K 2 into compression and resulting in a downward force on the bar. 

Step 3 Equation of motion 

Taking anti-clockwise moments of the forces about the fixed pivot at A, noting that the 
moment of inertia of the mass about the pivot is mL 2 2 , we get: 

-K x L t Q XL, - K 2 LoQ X 4 = mL 2 2 0 

Rearranging, 

mL 2 2 0 + (Lf, 4 2 + K 2 L 2 2 )Q = 0 

From the coefficients of 0 and 0, the expression for the natural frequency is: 


Ik x l x 2 + k 2 l 2 2 

v 


(6.4) 


Lumped mass-spring systems 

Many structures exhibit more than one mode of vibration, each with a different frequency. For 
example, the Tacoma Narrows bridge had become known for its bending vibration almost from 
the day it opened, but it was a torsional vibration excited by a particular speed of cross wind 
that cause its collapse. 

Some structures can be approximated by several rigid bodies connected by massless springs. The 
analysis approach is an extension of the one used for single-degree-of-freedom systems, except 
that each of the rigid elements will have a separate free-body diagram. Each will also have its 
own equation of motion. 
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Example 1 : Single-axle caravan 

In this example, the body of the caravan is assumed to behave like a rigid mass and is connected 
to the axle (also assumed to be a rigid mass) by road springs in the suspension. The axle is 
separated from the road by flexible tyres. The mass of the tyres and the suspension springs 
is assumed to be negligible in comparison with the body and axle parts. A suitable dynamic 
model for studying vertical vibration is shown below. 



Caravan body (sprung mass) 
Road springs 

Axle and wheels (unsprung mass) 

Tyres 

Road 


Figure 6.15 



Since the body and the axle can move separately from each other, this model has two degrees 
of freedom (two independent possible motions). Two coordinates are needed to describe how 
the system moves: axle displacement and body displacement. 

We will see that the model will predict two natural frequencies, each of which will have a 
characteristic pattern of displacement, called a mode shape. 


Step 1 Dynamic mass— spring model 



Figure 6.16 


Step 2 Free-body diagrams 

Upward (positive) displacement of mass m\ will put the spring k\ into tension, resulting in a 
downward force on the underside of the mass. The force in spring k 2 is given by the stiffness 
multiplied by the change of length due to the displacements X\ and x 2 . Since both have been 
chosen to be positive upwards, the net change of length is the difference between them, that is 
(x\ — x 2 ). The force in the spring is therefore k 2 (x\ — x 2 ). This is positive whenever X\ > x 2 , 
that is, whenever the spring is in compression. The compressed spring will exert forces in the 
direction shown in Figure 6.17. 
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EF 

Ml - X 2 ) 


k 2 (x 1 - X 2) 



Mi 


Figure 6.17 


Mi -*2) 



Mi - *2) 


fci 



Mi -^ 2 ) 



Mr*2> 


We could have written the net change of length of the spring as ( x 2 ~ x\) to give the force 
expression as k 2 (x 2 ~ X\). This is positive whenever x t < x 2 , or whenever the spring is in 
tension. With the spring in tension, the directions of the forces are reversed. Figure 618 Case 1 Spring in 

The two cases are equivalent to each other, so both are correct. compression .v-, > x 2 


Note: The free-body diagram shows a ‘snapshot’ of the system when x 1 and x 2 are both 
positive. The arrows on the diagram need to show the positive direction of the forces that each 
spring exerts on the masses. The expression for each force is given by 


spring force = stiffness X change of length. 

Take each spring in turn and ask yourself the following questions: 

• What is the change of length of the spring? 

• Is the spring in tension or compression? 

For the change of length, look at the positive displacements of the two ends of the spring. If 
the direction of movement at both ends is the same (as it is in this example), the change of 
length will be the difference between the individual displacements. If, however, the chosen 
positive directions for the two ends of the spring are in opposite directions, the change of 
length will be the sum of the individual displacements. You need to be systematic when 
setting up the free-body diagrams. 



Figure 6.19 Case 2 Spring 
in tension x ^ < x 2 


Step 3 Equations of motion 
For mass m 1; 

“Ml “ Mi - *2) = Mi 


and for mass m 2 , 


+ k 2 {x\ - x 2 ) = m 2 x 2 
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and 

In matrix form, 


m\X\ + (k x + k 2 )x 1 — k 2 x 2 = 0 
m 2 x 2 — k 2 x\ + k 2 x 2 = 0 


[o' tIKMol 


(6.5) 

(6.6) 


[M]{x} + [K]{x} = {0} 

where [M] and [K] are called the mass and stiffness matrices for the system. 

As with single-degree-of-freedom systems, we can check for errors in the equations at this 
stage. In particular, 

(1) the terms in the leading diagonals of [M\ and [fC] are always positive; 

(2) the off-diagonal terms may be positive or negative; 

(3) [ M\ and [X] are often symmetric about the leading diagonal. 

The equations are coupled: each involves both x x and x 2 . Physically, the coupling tells us that 
if one mass moves, the other mass will also move; push the top mass down and the lower mass 
moves as well. Put another way, motion of one mass cannot occur independently of the other. 
Mathematically, the coupling means that the equations must be solved simultaneously. 

For free vibration of the system at one of its natural frequencies, the motion of each mass will 
be sinusoidal. Use as substitutions, x x (t) = X 1 cos tot and x 2 (t) = X 2 cos (at, where X, and X 1 
are the amplitudes of vibration of each mass. 

Substituting into equations (6.5) and (6.6) and cancelling cos (at , which is a common factor in 
all terms, gives 

-m x (a 2 X x + (k x + k 2 )X x -k 2 = 0 

and 

-m 2 (a 2 X 2 - k 2 X x + k 2 X 2 = 0 

or in matrix form 

(k x + kf) ~ mpa 2 - k 2 1 rXJ = mi (6.7) 

-k 2 k 2 ~ m 2 (a 2 \ 1X 2 J lOj (6.8) 

([X] - co 2 [M]){X} = {0} 


or just 


where 


[Z|{X} = {0} 


[Z\ = [K\- <a 2 [M\ 

Equation (6.9) is in the form of an eigenvalue problem; commonly presented in maths 
textbooks in the form: 


([A] - \[B]){X} = {0} 

In our vibration problem, the eigenvalues give the natural frequencies and the eigenvectors 
give the corresponding mode shapes. 

There are many methods of solving eigenvalue problems. In practice, most engineers will turn 
to software such as Matlab, which has the necessary functions. A Matlab script for doing this is 
provided on the book’s website: www.hodderplus.co.uk/mechanicalengineering. For problems 
with only two or three degrees of freedom, however, the following procedure can be used. 
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For a non-trivial solution of equation (6.9), 

det[Z] = 0 

Multiplying out the determinant in this case gives 

m 1 m 2 a> 4 — (m,h 2 + w 2 (fei + k 2 ))w 2 + ^k 2 = 0 (6.10) 

Equation (6.10) is called the Frequency equation. For this problem, it is a quadratic in to 2 
and will have two roots, oo nl 2 and «„ 2 2 , where co nl and (o n2 are the two natural frequencies of 
the system and this gives us the first part of the information we are looking for. 

To find the corresponding mode shapes, we substitute each root back into equation (6.7) or 
(6.8) to get the relationship between X 1 and X 2 . Since equations (6.7) and (6.8) form a pair of 
homogeneous equations, we cannot find unique solutions for Xj and X 2 separately. 

One way of resolving this is to make the amplitude of one mass equal to unity and then find 
the amplitude of the other mass relative to this. For example, if we choose X 2 = 1, equation 
(6.8) becomes: 

(k t + k 2 - w 2 m 1 )X 1 - k 2 ■ 1 =0 
Hence, ^ 



fXH 

The vector j is the required mode shape, which tells us the relative amplitude and phase 
of the two masses. 

Here is a numerical example for the caravan problem. Let m\ = 70 kg, m 2 = 350 kg, 
k\ = 400 N/m and k 2 = 40 N/m 

Natural frequencies 

Substituting into equation (6.10) and solving gives w wl 2 = 103.7 s -2 and co„ 2 2 = 6296.3 s -2 . 
Hence, (*)„! = 10.18 rad/s = 1.62 Hz and «„ 2 = 79.35 rad/s = 12.63 Hz 


Mode shapes 

Mode 1 

Put w nl 2 = 38.1 s -2 into equation (6.11) to give: 


{*:HTo 2 } 

In this mode, the axle and the body vibrate in phase with each other (Xj and 
X 2 are both positive values) and the amplitude of the displacement of the body 
is about 10 times that of the axle. This is because the tyres are relatively stiff 
compared with the springs in the suspension. It’s a low frequency vibration 
(1.62 Hz) and is commonly called the ‘bounce’ mode of the vehicle. A Matlab 
script is included on the book’s website; that shows an animated version of the 
mode shapes, and Figure 6.20 is a still from the animation showing the positions of 
the two masses at the instant that mass m 2 is at its uppermost displaced position. 

Since the axle movement is small compared with that of the axle in this mode, 
we could have made the assumption that the tyres were effectively rigid, 
implying that the axle didn’t move. This would have given a single-degree-of- 
freedom system with the body mass supported only by the suspension springs. 

The natural frequency for this system is 1.70 Hz, which is similar to the figure of 
1.62 Hz obtained above. 
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Mode 2 

Put «„ 2 2 = 261.8 s“ 2 into 


equation (6.11) to give: 


P 1 ! = p 54 - 09 ! 

1*2 J 1 1.0 1 


In this mode, the axle and the body vibrate out of phase with each other (Xj and 
X 2 have opposite signs), and the amplitude of the displacement of the axle is over 
50 times that of the body. It’s called a ‘wheel-hop’ mode and is at a much higher 
frequency (12.6 Hz) than the bounce mode. 


Example 2: 2D vehicle model (coupled bounce and pitch) 

The previous worked example showed that the bounce mode of the caravan 
could be estimated by assuming that the tyres were rigid. The second example 
applies this observation to a four-wheeled vehicle to study the combined effects 
of bounce and pitch. 


Step 1 The following assumptions will be made in developing the dynamic model: 


(1) Roll motion is not considered — pitch and vertical translation only. 

(2) The body is rigid, with mass, m, and moment of inertia, I G about its centre of mass. 

(3) The tyres are very stiff, so that the axles do not move. 

(4) k A and k n are the combined stiffnesses for the front and rear springs respectively. 

(5) The shock absorbers are ignored. 



WI-MUH 


mu 





Equilibrium 

position 


Figure 6.22 

A and B (the attachment points for the suspension springs) along with the centre of mass, G, 
are marked on the dynamic model, which is shown in its equilibrium position. 
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We will use x G (displacement of G) together with 0 (pitch angle) as motion coordinates. These 
are convenient since they link directly with the two equations of motion for the body. The 
vertical translation equation will need the absolute acceleration of the centre of mass and the 
rotational equation will use G as the axis (there is no fixed axis on the car body) and we will 
need the angular acceleration of the body. 


Step 2 Free-body diagram 

To draw the free-body diagram, it has been assumed that x G and 0 are such that both springs 
increase in length. For small 0, the increases in spring lengths are: 


and 


x A = x G ~ *0 


x B = x G + 60 

With both springs in tension, the positive directions of the 
forces on the body are as shown in Figure 6.23. 

Step 3 Equations of motion 

For vertical translation of the centre of mass, 

~k A (x c - a0) - k B (x G + 60) = mx G 
For rotation about the centre of mass, 

+ k A (x G - 30) ■ 3 - k B (x G + 60) ■ b = Z G 0 G 

Rearranging, 


(6.12) 


x c 



Figure 6.23 


mx G + (k A + k B )x G + (bk B ~ ak A )Q = 0 


Z G 0 + (6fe b — 3fe A )x G + (a 2 k A + 3 2 fe B )0 — 0 

In matrix form, 

\m 0 ]\x G \ \k A +k B bk B — ak A 1J*g1 (01 

[ 0 h J l 0 J [bk B ~ ak A a 2 k A + b 2 k B \[ 0 J (0 J 

For a solution, put x G (t) = X G cos wf and 0(t) = 0 cos u>t. 

This gives, 

\k A + k B ~ mu 2 bk B - ak A lpCo! _ mi 
[ bk B — ak A a 2 k A + b 2 k B — Z G to 2 Jl 0j |o J 


(6.13) 


13 « = 


(6.14) 


To find the natural frequencies, solve the frequency equation given by det[Z] = 0. 

The mode shapes can be found by substituting the natural frequencies into either equation 
(6.13) or (6.14). Using equation (6.13) for example: 

(fe A + k B — mu 2 )^ + (bk B — akf)Q = 0 


f(3fe A - bk B )/(k A + k B ~ mb) 2 ) 
1 1.0 


Let 0 = 1 rad to give: 


10 


(6.15) 
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Worked example 


m = 900 kg 
k A = 25 kN/m 


I G = 1000 kg m 2 
k B = 10 kN/m 
b = 2m 


Natural frequencies 

The frequency equation becomes 

1 35 X 10 3 - 900m 2 -5 x 10 3 I _ 

-5 X 10 3 25 X 10 3 + 40 X 10 3 - 1000m 2 1 ~~ 0 

1 35 - 0.9co 2 -5 I _ 

or 1-5 65 — m 2 I — 0 

Expanding, 0.9m 4 — 93.5m 2 + 2250 = 0 

Roots are M nl 2 = 37.8 s -2 and m „ 2 2 = 66.0 s -2 


Hence, M nl = 0.98 Hz and m „ 2 = 1.29 Hz 


Mode shapes 

Mode 1: Put m „, 2 = 37.8 s -2 into equation (6.15) to give | q J = j^'^J ^ 

1X A 1 . 

To visualize the mode shape, j ^ 1 is more convenient. This ratio can be obtained by 
using equations (6.12). Thus, 1 n ' 

(X A | = (X G - = 1 5.43 1 XI ) 14.431 

(X B J [x G + fee J 15.43 + 2 X lj 17.43 J 

Normalizing to make X B = 1.0, we get: 

P A 1 = {°- 60 l 

|x B ) 1 1.0 J 



position 

Figure 6.24 


This ‘snapshot’ of the mode at its extreme position shows a predominant bounce motion, 
with a little pitching. 
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◄ 


Mode 2: Put w„ 2 2 = 66.0 s 2 into equation (6.15) to give 



-0.205' 
1.0 . 


rad 


or from equation (6.12), 




A 


X A = -0.67 


Figure 6.25 


In this case, we see mainly pitching motion, with only a small displacement at the centre 
of mass. 


If the car was driven over a road with a gradually undulating profile (such as produced by 
ground subsidence), the car will tend to bounce up and down with mode 1 dominating the 
response. Alternatively, if the front wheels hit an obstruction in the road (for example, the 
speed bumps used in traffic calming schemes in urban areas), it’s likely that a pitching response 
dominated by mode 2 would result. We will see in Section 6.4 that the response of a structure 
can be written as a combination of its modes of vibration, so in the general case, the response 
will be a mixture of modes 1 and 2. 


Most mechanical power is transmitted via rotating shafts. Examples include the turbine- 
alternator sets in power stations, the tail rotor drive in a helicopter and the propeller shaft in a 
ship’s propulsion system. One of the problems for the vibration engineer is that power surges 
or sudden changes in load can induce transient torsional oscillations that are superimposed on 
the uniform rotation. Also, the drive torque can sometimes contain fluctuating components 
that can set up steady state torsional oscillations in the system. In each case, the torsional 
natural frequencies and modes shapes are required to study the behaviour. 

Example: Main drive shaft of a gas turbine engine 

There are two drive shafts in the V2500 engine, which powers the majority of Airbus 320 
aircraft (and several others) currently in service. The main shaft transmits power from a 
five-stage turbine at the rear to the fan and the first three compressor stages. The whole 
assembly rotates freely, supported by three bearings. To study the torsional behaviour, the 
dynamic model will assume that the fan and turbine assemblies can be treated as rigid elements 
and that the mass of the shaft can be neglected. 


Torsional systems 
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Figure 6.26 V2500 twin-shaft aero engine 


Step 1 Dynamic mass— spring model 
Fan assembly 



Figure 6.27 

The engine rotates anti-clockwise when viewed from the front, so this has been chosen for the 
positive direction of rotation for the two motion coordinates, 0] and 0 2 . 

Step 2 Free-body diagrams 






An Introduction to Mechanical Engineering: Part 2 


If both motion coordinates are given a positive rotation, the twist in the shaft will he the 
difference between the two. Hence the expression for the torque in the shaft can be written 
as: fe(0i — 0 2 ). This is positive whenever 0j > 0 2 and results in the torque directions shown in 
Figure 6.28. 

Step 3 Equations of motion 

For the fan, — fe(0i — 0 2 ) = Fi0i 

For the turbine, +k(Q 1 — 0 2 ) = / 2 0 2 


Rearranging, 


iA 

+ fe0j - 

fe0 2 

, = 0 



and 


/ 2 0 2 

- fe0! + 

fe0 2 

1 = 0 



In matrix form, 









T 1 

°i(0 

l)[ k 

-k 

1N- 




[o 

4] 16; 

jl-k 

k 

JW 

( 0 ) 


Substitute Q x (t) = 

©i cos c of and 0 2 (f) = 

0 2 COS (tit 

to give 





- I x (ti 2 

-k 

L 


.{°1 

(6.16) 


i 

-k 

k - 1 2 (, 


le 2 J 

lot 

(6.17) 


Expanding the determinant of the coefficient matrix, the frequency equation is 

I X I 2 « 4 - Kk + hW = 0 


This has, as its roots, u> nl 2 = 0 and 


Kh + h) 
hh 


To find the mode shapes, substitute roots ii 
and putting 0 2 = 1, 

£}- 


o equation (6.16) 


| ( k ~ I 2 (ti 2 )/k) 

I 1.0 I 


(6.17). Using equation (6.17) 


(6.18) 


For mode 1, (ti nX = 0 and |q' J = {{}' This describes a rigid body mode, which in this case 

tells us that the rotor is able to rotate continuously in one direction, with no twisting of the 
shaft (0i = 0 2 ). 


In all cases, a rigid body mode is characterized by a natural frequency equal to zero and by a 
mode shape in which there is no deformation of any of the parts; in other words, the system 
moves as if it were a single rigid body. 


Any structure that is capable of moving without deformation (this is true of any structure not 
connected to the ground) will have one (or more) rigid body modes with «„ = 0. It follows 
that the frequency equation will not contain a constant term. Since you can tell in advance 
that this should be the case, it’s a useful check that the frequency equation is correct. 


For mode 2, 


IHh + h) 


{ell 


ra 


This mode has a non-zero frequency and the fan and turbine vibrate 180° 
each other, resulting in significant alternating twisting of the shaft. 


of phase with 


Shaft whirl and critical speeds 

Shaft whirl is a potentially destructive, self-sustaining flexural vibration observed in rotating 
shafts. It occurs if the rotational frequency of the shaft coincides with a resonant frequency for 
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flexural vibration. These shaft speeds are called critical speeds. A given shaft will be designed 
to operate with some maximum speed. Ideally, if this maximum design speed is less than the 
lowest critical speed, whirl will not be a problem. Unfortunately, this is not always possible 
and it is vital to be able to calculate what the critical speeds will be. For example, the need 
to minimize the mass of the drive shafts in an aero engine means that they may have critical 
speeds within the operating range. 

To examine the characteristics of shaft whirl behaviour, we will model the shaft as a ‘beam’ 
with a circular cross section in order to find its natural frequencies in flexure. The analysis 
below shows that shafts potentially have an infinite number of flexural natural frequencies, 
which means that they have an infinite number of critical speeds. 


Flexural vibration of uniform beams 

Apart from shafts, many structures exhibit beam-like vibrational behaviour. Examples include 
aircraft wings, helicopter rotor blades and suspension bridge decks (all of which can vibrate 
in response to aerodynamic interaction) and tall buildings, which vibrate significantly during 
earthquakes. While these are more complex than uniform beams, they exhibit many of the 
same characteristics and this section will provide some insight into this behaviour. 



bx 


y 


Figure 6.29 


Unlike previous cases, a beam does not consist of discrete masses connected by massless 
springs. Both mass and stiffness are distributed along the length. A different approach is 
required and we start by considering the free-body diagram of an infinitesimal element of the 
beam of length 8x. 



Sign conventions 


Figure 6.30 


The equations derived and used in this section depend on the use of a standard sign 
convention (shown in Figure 6.30) that defines the positive directions of the transverse 
displacement, y(x, t), and of the shear force and bending moment. It is the same sign 
convention as is used in Unit 3. 


Bending moment — curvature relationship is 


, d 2 y 


(6.19) 
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Equation for vertical motion (downward motion is positive): 
S 


s + t? 8 *) = (p^ 8 *)§i 


J 2 y 


(6.20) 


To simplify the analysis, we will neglect the rotational inertia of the element and the shear 
deformation of planes of cross section. Both assumptions tend to give an over-estimate of the 
natural frequencies of the beam. While this error is normally small for the first few modes, it 
increases progressively when higher frequencies are evaluated. 

The equation for rotational motion about an axis through the centre of mass of the element is: 


c- 

8 dx 


( 6 . 21 ) 


Substituting for M from (6.19) 


equation (6.21) and then for S in equation (6.20) we get 


d 4 y 




EI ^- 


(6.22) 


This is the governing differential equation for the free vibration of the beam. 

Equation (6.22) is a partial differential equation giving the deflection, y, which is a function of 
space x and time t. The objective in solving the equation will be to find the natural frequencies 
and the corresponding mode shapes. 

For free vibration at a natural frequency, the displacement of any point on the beam in the 
y-direction will be sinusoidal, but the amplitude of the vibration will vary along the length. 

We can therefore use as a substitution, 

y(x, t) = Y(x) cos (at 


Substituting into (6.22), we get 


d 4 y 

El cos (at = pA or Y(x) cos (at 


d 4 Y 
dx 4 


For a uniform cross section, A and I ai 
called wavenumber, X, defined by 


-m 

constant and it is convenient ti 

i4_ 


introduce the so- 


(6.23) 


to give 

Take as solution: 
Thus, 


d 4 y 

dx 4 


= \ 4 Y{x) 


Y(x) = Ae ax 


a 4 Ae ax = X 4 Ae ax 


a 4 = \ 4 


so that 


a = ±\ or ±iX 
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The complete solution for Y(x) is therefore 

Y(x) = Af e^* + A 2 e~^ x + A 3 e^ x + A 4 e~^ x 
which may be rewritten to give the more convenient form, 

Y(x) = Ci sin Xx + C 2 cos Xx + C 3 sinh Xx + C 4 cosh Xx (6.24) 

This is a general equation giving the deflected shape of any beam of uniform cross section. 
The constants Q — C 4 need to be determined from the boundary conditions at the ends of 
the beam. Four basic types of support are shown in Table 6.1 and other types of boundary 
conditions are considered later. 



Table 6.1 Boundary conditions for basic types of support 

The following steps should be used to find the solutions for particular cases: 

Step 1 Start by identifying the four boundary conditions. Use y(x, t) = Y(x) cos (at, with 

equation (6.24) to express the boundary condition in terms of Y(x) and its derivatives. 

Step 2 Since each of the boundary condition equations depends on Q — C 4 , they can be 
assembled in the form 

[Z]{C} = {0} (6.25) 

where { C} is a vector of the constants C 4 — C 4 and [Z] is a coefficient matrix. 

Step 3 For a valid solution, det [Z] = 0. 

This gives the frequency equation and its roots will give the natural frequencies of 
the beam. 
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Step 4 When each root is substituted back into equation (6.25), the solution vector {C} will 
define the corresponding mode shape when the values are put into equation (6.24). 


Pinned-pinned 

sin XL = 0 

Clamped-clamped & free-free* 

cos XL cosh XL - 1 =0 

Clamped-pinned & free-pinned* 

tan XL - tanh XL = 0 

Clamped-free 

cos XL cosh XL +1=0 


Table 6.2 Frequency equation for particular end conditions 


4 5 


Pinned-pinned 

IT 

2tt 

3tt 

IT 

5tt 

l"7T 

Clamped-clamped 
& free-free* 

4.730 

7.853 

10.996 

14.137 

17.279 

~ (r + 0.5) tt 

Clamped-pinned & 
free-pinned* 

3.927 

7.096 

10.210 

13.351 

16.493 

« [r + 0.25) -it 

Clamped-free 

1.875 

4.694 

7.855 

10.996 

14.137 

« (r- 0.5) it 


Table 6.3 Numerical values of roots, X r L, of frequency equations. 


A free-free beam will also have two rigid body modes and a free-pinned beam o 
rigid body mode, each corresponding to XL = 0. 

Selecting the values of X r L from Table 6.3 for the beam of interest, the natural 

(X r L\ 2 [El 

frequencies can be found from equation (6.23). That is: co r = I —j— I j 


Example 1: Simply supported beam 


Step 1 

The boundary conditions at x = 0 and at x = L are: 

d 2 y 

y = 0 and M = 0, the latter implying that = 0. 

Since y(x,t) = Y(x) cos cut, the boundary conditions become 

d 2 y 

Y=0 and ^§- = 0 

From equation (6.24), 

Y(x) = Q sin Xx + C 2 cos Xx + C 3 sinh Xx + C 4 cosh Xx 
= —X 2 C, sin Xx — X 2 C 2 cos Xx + X 2 C 3 sinh Xx + X 2 C 4 cosh Xx 

Note that sinh 0 = cosh 0 and 4|rcosh 0 = sinh 0 
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Hence, at x = 0 

Y(0) = Q X 0 + C 2 X 1 + C 3 X 0 + C 4 X 1 = 0 
( c[?) _ 0 = Q x 0 “ X 2 Q X 1 + X 2 C 3 X 0 + A 2 C 4 X 1 = 0 

Y(L) = C, sin XL + C 2 cos XL + C 3 sinh XL + C 4 cosh XL = 0 
= — A. 2 C 4 sin XL — X 2 C 2 cos XL + X 2 C 3 sinh XL + X 2 C 4 cosh XL = 0 

Step 2 

Assembling the four equations in matrix form gives: 


0 

1 

0 

1 

Ci 


°] 

0 

-x 2 

0 

X 2 

C 2 


°[ 

sin XL 

cos XL 

sinh XL 

cosh XL 

Q 


of 

— X 2 sin XL 

— X 2 cos XL 

X 2 sinh XL 

X 2 cosh XL 

c 4 


oj 


Step 3 

This is the particular form of equation (6.25) for a simply supported beam. Expanding the 
determinant of the coefficient matrix and equating to zero gives the frequency equation, 
so-called because the wavenumber is directly related to the frequency via equation (6.23). 

— 4 X 4 sin XL sinh XL = 0 

To identify the roots of the frequency equation, we note that a simply supported beam cannot 
deflect without bending. In other words, it has no freedom to move as a rigid body and will 
not, therefore, have a rigid-body mode of vibration. From equation (6.23), it follows that 
X = 0 will not be a root of the frequency equation. Not only is X # 0, but sinh XL # 0. 

The frequency equation therefore reduces to 

sin XL = 0 


which has roots X r L = r tt for r = 1, 2, 3, ... 

From equation (6.23), the natural frequencies are for r = 1,2, 3, ... 

Step 4 

To find the corresponding mode shapes, substitute the roots into equation (6.26) and solve for 
the constants Q — C 4 . 

From (6.26a), C 2 + C 4 = 0 

From (6.26b), X 2 (-C 2 + C 4 ) = 0 

Since X r # 0, it follows that C 2 = C 4 = 0 

From (6.26c), sin X,L • G, + sinh X r L ■ C 3 = 0 

Since sinX r L = 0 (the frequency equation) and sinh X r L # 0, C 3 = 0. 

The only non-zero constant is Q. Its value is arbitrary and we normally choose C 4 = 1. From 
equation (6.24), the mode shape expression becomes: 

Y r (x) = sin \ r x = sin-^r^, with r = 1, 2, 3, ... 

The first three mode shapes are as follows (there are animated versions on the book’s 
supporting website): 
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Example 2: Vibration of a cantilever (clamped-free) beam 

■ x = 0 x=L 


Consider a cantilever that is clamped at x = 0 and free a 

Step 1 

The boundary conditions are: 

dv 

At jc = 0, 


y = 0 and v— = 0 


At x = L, 


M= 0 


d^y 

dx 2 “ 


a 3 y _ 


dx 3 


and S = 0 

Since y(x, t) = Y(x) cos (at, the end conditions become: 
At x = 0, 


Y = 0 and 4^ = 0 
dx 


At x = L, 


d 2 _Y 

dx 2 


d 3 V 

= 0 and -^3 = 0 
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Step 2 

Substituting from equation (6.24) we get (in matrix form), 

r 0] (a) 

0 (b) 

0| (c) 

oj (d) 

This is the particular version of equation (6.25) for a cantilever beam. 


0 

X 

1 

0 

0 

X 

1 

0 

Q 

c 2 


— X 2 sir 

1 XL —X 2 cos XL 

X 2 sinh XL 

X 2 cosh XL 

C 3 


— X 3 sir 

1 XL —X 3 cos XL 

X 3 sinh XL 

X 3 cosh XL 

c 4 



(6.27) 


Step 3 

The frequency equation is given by setting the determinant of the coefficients of Cf — C 4 
to zero. After some manipulation (and noting that a cantilever has no rigid-body modes), this 
gives 


1 + cos XL cosh XL = 0 

There are no closed-form solutions to this equation, so the roots X r L must be obtained 
numerically and are given in Table 6.3 As before, the natural frequencies can be found using 
equation (6.23), which is the definition of the wavenumber, X. 

Step 4 

The mode shapes are obtained by substituting X = X r into equation (6.27) and solving for 
the constants Q — C 4 . 

From (6.27a) and (6.27b) C 3 = — C 4 and C 4 = — C 2 

Thus from (6.27c) or (6.27d) 


sinX r L + sinhX r L 
cos X r L + cosh X r L ^ 


= <T r Cj 


This gives C 2 , C 3 and C 4 in terms of Q , an arbitrary constant. 

If we choose Q = 1 , the mode shape becomes 

Y r (x) = sinX r x — sinhXrx + (7 r (cosX r x — coshXrx) 

When each value of X r is used in this equation, a different deflected shape is obtained. 
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Other boundary conditions 

Example: Cantilever beam with a mass at the free end 


= 0 




M of I I M 


Figure 6.37 


Step 1 

The boundary conditions at the clamped end are identical to the previous case. So, Y = 0 and 

f =0at* = 0. 
dx 

However, at x = L, S 0 and M 7^ 0. To look at the effect that the mass has on the vibration 
of the beam, we use two of the basic principles of mechanics. These are 

(1) compatibility of displacements 

(2) equilibrium of forces and moments. 

Consider first the shear force reaction between the beam and the mass. 


The principle of compatibility determines that the displacement at the end of the beam is the 
same as the displacement of the mass. 

The principle of equilibrium of forces determines that the shear force on the beam is equal 
and opposite to the force on the mass. 


The free-body diagram is: 


€ 



1 


1 

1 


x = L 

\y{L,t) S 


Figure 6.38 
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Note the positive directions for the displacement coordinate and the shear force on the beam 
defined by the sign convention adopted here. 

We can write separate statements relating to the beam and the mass, which can then be 
combined using the statement of equilibrium of forces. 


For the beam 

s « =Ei (S),. t 

But y{x,t) = Y(x) cos w 

= Ef (^?L = L COS “ 


For the mass 

S(t) = n 
S(t) = m X (— u> 2 Y(L) cos u>t) 


aM 

W/*-t 


Equating the shear force expressions and noting that a) 2 = ^ , 


( 0 ). 


pAL 4 


Y(L) = 0 


(6.28) 


Now consider the bending moment reaction between the beam and the mass. 

The principle of compatibility determines that the slope at the end of the beam is the same as 
the rotation of the mass. 


The principle of equilibrium of moments determines that the bending moment on the 
beam is equal and opposite to the bending moment on the mass. 

The free-body diagram is: 



Figure 6.: 


For the beam 



For the mass 

The rotational equation of motion for the 


/a 2 e\ 

Slope, 0(f) = = (4r) cos to 


Therefore |l=—oi 2 (g) ( 


Substituting into the equation of motion, 

M(f) = -/ M w 2 (^) x = i coso)f 

Equating the bending moment expressions, 

d 2 Y \ 4i(^) 4 /dY\ _ n 

d x 2 /x = l pAL 4 \d x/x = L 


(6.29) 
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Collecting the four boundary condition equations together, we have 


Y(0) = 0 (a) 


( d 3 Y\ m(XL ) 4 

(M^ + w Y(K) 


/ d 2 y \ _UM^(dY\ 

Id x z )x = L pAL 4 Id x)x = l 


0 

0 


(b) 

(c) 

(d) 


Step 2 

To set up equation (6.25) for this system, substitute for Y(x) and its derivatives from equation (6.24). 
Steps 3 and 4 then follow as in the previous examples. 


Other structures with distributed mass and stiffness 

The models presented in the earlier parts of this section are based on a range of assumptions 
that may or may not be acceptable in particular cases. An aircraft wing, for example, will 
exhibit bending modes of vibration, so the study of uniform beams gives some insight into the 
behaviour that can be expected. However, a wing does not have a uniform cross section, so 
the analysis will not be able to predict the natural frequencies of this structure accurately. 

For this and many other structures, engineers will invariably turn to the finite element method. 
There is an introduction to finite elements in Unit 3 related to obtaining displacements and 
stresses due to static loads. This involves creating and solving an equation of the form: 

[K]{X} = {P\ 

[X] is a global stiffness matrix formed by combining the individual stiffness matrices of the 
finite elements chosen to discretize the structure, and {P} is a vector of applied loads. {X} is 
the solution vector giving the displacements at the nodes. 

As well as solving static problems, most commercial finite element codes can also solve 
vibration problems by calculating a mass matrix in addition to the stiffness matrix. This can be 
used to set up and solve an eigenvalue problem in the form: 

([X] - io 2 [M]){X} = {0} 

This can be solved to give the natural frequencies and mode shapes for the structure. 


Learning summary 

By the end of this section you should have learnt: 

/ a systematic approach for setting up the equations of motion for single-degree-of-freedom and 
lumped mass-spring systems using the following three steps 
Step 1 convert the physical structure into a dynamic mass-spring model 
Step 2 draw free-body diagram(s) 

Step 3 apply the appropriate form(s) of Newton’s second law of motion to give the equation(s) of 
motion for the system; 

/ to obtain the natural frequency of single-degree-of-freedom systems from the coefficients in the 
equation of motion; 

/ the equations of motion for lumped mass-spring systems and how to obtain the natural 
frequencies and the corresponding mode shapes; 

/ to set up the boundary condition equations for shaft/beam vibration problems and obtain the 
frequency equation and an expression for the mode shapes. 
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6.3 Response of damped single- 
degree-of-freedom systems 

Introduction 

So far, we have looked at free vibration of systems and at their natural frequencies and mode 
shapes. This section will analyse the response of single-degree-of-freedom systems to external 
excitation. This takes the form either of applied forces and/ or moments or of imposed 
displacement on part of the system. 

We will also introduce the effects of damping. Damping is the phenomenon that dissipates 
energy in a structure. If there was no damping and a structure was set vibrating and then 
left, the mathematics would suggest that it would carry on vibrating forever. This, of course, 
is impossible and the structure would stop vibrating sooner or later. This is because all real 
structures dissipate energy to a greater or lesser extent. There are many ways this can take place 
and mechanisms include hysteresis effects in the material, friction between parts, aerodynamic 
interaction with the surrounding fluid and noise radiated from the surfaces. Most real damping 
mechanisms are difficult to handle mathematically and we will consider one theoretical 
damping model, called viscous damping and will only consider discrete dampers. These can 
be pictured as a piston— cylinder device in which a viscous fluid is displaced from one side of 
the piston to the other through a constriction such as an orifice. Vehicle shock absorbers are 
normally based on this idea. 

Fluid-filled Piston allowing flow 




Figure 6.40 

In the viscous damping model, we assume that the force in the damper is proportional to the 
relative velocity between the ends and acts to oppose the imposed motion. The constant of 
proportionality is called the damping coefficient (normally given the symbol c) and has units of 
N/(m/s) [orNs/m], 

Hence the force opposing the motion is c(x — y). 

Note that dampers do not impose any stiffness on the structure; they only transmit a force if 
there is relative motion between the ends. If there is no motion, there is no force. 

In most engineering structures, the level of damping is low. As a result, any discrepancies 
between the assumed viscous damping model and the actual damping mechanism are generally 
small, so that the error introduced by our mathematical model is also small. 

Equations of motion 

In the examples that follow, the steps leading to the equation of motion are the same as 
presented in Section 6.2 with the addition of any damping forces and any external excitation. 
We will then look at the solution to the equation of motion for a few types of excitation. 
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Example 1: Mass-spring-damper system 

Step 1 Dynamic mass— spring model 


Figure 6.41 

Step 2 Free-body diagram 


Figure 6.42 

Positive (downward) displacement of the mass from its equilibrium position will put the spring 
into compression, resulting in an upward force acting on the underside of the mass. Positive 
(downward) velocity of the mass will compress the damper. The damper will oppose this 
motion, resulting in an upward force acting on the underside of the mass. 

Step 3 Equation of motion 

p — kx — cx= mix 

or mix + at + kx = pit) (6.30) 




Example 2: Rocker system 

Step 1 Dynamic mass— spring model 



Figure 6.43 
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Step 2 Free-body diagram (for small displacements) 

This is similar to the rocker system given as an example in Section 
6.2, but with the addition of the damper and the applied force. 

For the damper, a positive (anti-clockwise) angular velocity will 
produce upward velocity of the mass and cause the damper to 
compress. The damper will oppose this motion, resulting in a 
downward force acting on the top of the mass. 

Step 3 Equation of motion 

Taking moments about the fixed pivot at A, the equation of motion 
is: 

-K^Q ■ L, - K 2 L 2 Q ■ L 2 - cL 2 Q L 2 + p ■ L 2 = / A 0 

Again modelling the rocker as a mass on the end of a massless bar, the moment of inertia of the 
rocker about the pivot is I A = mL 2 . 

Hence, on rearranging, 

mZ^ 2 0 + cL 2 2 0 + (FCjLj 2 + K r L2 2 )0 = p(t)L 2 (6.31) 



Example 3: Single-axle caravan 

We will make the following simpkfying assumptions: 

(1) The tyres are very stiff compared to the suspension springs (typically they are about ten 
times stiffer). 

(2) The tyres do not lose contact with the road. Taken with (1), this means that vertical 
motion of the axle will follow the road profile exactly. 

(3) The caravan body behaves as a rigid mass. 

(4) Only vertical motion of the body is considered; pitching and rolling are ignored. 

Step 1 Dynamic mass— spring model 


Body 


Springs and 


Axle 





Figure 6.45 

Step 2 Free-body diagram 

This is an example where the excitation is in the form of a prescribed displacement 
instead of a force. With the assumption that the tyres are rigid, the road profile over which 
the caravan travels exactly defines the motion (displacement and velocity) of the axle. Any 
difference between the relative displacement or velocity of body and axle will give rise to 
forces in the springs and dampers that will react onto the underside of the body. 

Recall that the free-body diagram is a snapshot of a system when all motions (both 
displacements and velocities) are positive. It shows the positive directions of the forces 
that the springs and dampers exert on the mass. The force in the spring is given by Spring 

force = Stiffness X Change of length. In this case, the change of length of the spring is the 2c(r-x) 2 k(r-x) 

difference between the body and axle displacements, which we can write as (r — x). This 
is positive whenever the axle displacement is greater than the body displacement. In this Figure 6.46 
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situation, the spring will be in compression and will produce an upward force on the body. 

For the damper, Damper force = Damping coefficient X Relative velocity. The relative velocity 
can be written as (r — x). With positive (upward) velocities of both axle and body, the 
expression for the relative velocity will be positive if the axle velocity is greater than the body 
velocity. In this situation, the damper will be compressing (i.e. getting shorter). The damper 
will resist this with the result that there will be an upward reaction force acting on the caravan 
body. 

Step 3 Equation of motion 


mx + 2 cx + 2kx = 2 cr(t) + 2 kr(t) 


(6.32) 


For any given vehicle speed, the shape of the road profile, r(t), can be measured as an 
explicit function of time, so that we know exactly what the displacement of the axle will be. 
Differentiating the displacement gives the axle velocity r(t), again as an explicit function of 
time. As a result, the excitation terms on the right-hand side of the equation of motion are 
completely defined. 

Collecting the equations of motion for the three examples, we have: 

Mass-spring-damper system mx + cx + kx = P(t) 

Rocker system mL^Q + cL 2 2 Q + {K X L X 2 + =L 2 P(t) 

Single-axle caravan mx + 2 cx + 2kx = 2c • f(t) + 2k- r(t) 

While each of the equations is different in detail, you will see that they all share a common 
mathematical form, that of a second-order ordinary differential equation with constant 
coefficients. All linear, single-degree-of-freedom systems have this form, which can be 
written genetically as: 

Mz + Cz + Kz = F(t) (6.33) 


in which 


z is the response coordinate 

M is the coefficient of z 

C is the coefficient of z 

K is the coefficient of z and 

F(t) is the excitation function; independent of z. 

The exact form of the three coefficients and of the excitation function will depend uniquely 
on the system being analysed. 

The subsequent mathematical manipulation required to solve the equation of motion depends 
on the nature of the excitation function and on the amount of damping in the system. 
Mathematics textbooks provide comprehensive coverage of such problems, and Unit 4 
presented the use of Laplace transforms for this purpose. In this unit, however, we consider 
three specific cases, namely ‘free’ vibration and harmonic (sinusoidal) and arbitrary periodic 
excitation. 


Free vibration 

If no externally applied forces or moments act on a structure, it can vibrate freely - hence the 
term ‘free’ vibration. We have previously used this situation to find the natural frequency of 
undamped systems. As mentioned above, the presence of damping means that any vibration 
will stop sooner or later, and consequently the term ‘transient response’ is often used in place 
of free vibration. 
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For F(t) = 0, we can use a solution in the form, z(t) = Ae kt 
Substituting into the equation of motion gives, 

M\ 2 Ae Xt + CXAe Xt + KAe kt = 0 

For a non-trivial solution, 

MX 2 + CX + K = 0 


so that 


V2 - 


-C± VC 2 - 4 KM 
2 M 


( 6 . 34 ) 


The complete solution is then 

z{t) = H ie V + A 2 e x 2 t ( 6 . 35 ) 

The integration constants, A 1 and A 2 , are found from the ‘initial conditions’ specified in the 
problem. These will normally be the displacement and velocity at t = 0. 

It can be seen from equation (6.34) that the roots X^ 2 can be either real or complex, 
depending on the amount of damping present. While equation (6.35) gives a mathematically 
correct description of the response, other forms relate more closely to engineering intuition 
and are thus easier to interpret. Four cases with different damping levels are considered. 


Zero damping 

For zero damping, the system will oscillate with a sinusoidal waveform, although this is not 
obvious from equation (6.35). 


Xi, 2 = : 


The term y is called the undamped natural frequency and is given the symbol 

Previously, we used the term ‘natural frequency’ for this. As we shall see later, a damped 
system will have a frequency at which it will vibrate freely, so the words ‘undamped’ or 
‘damped’ are used to distinguish between the two. 

Returning to the general case, equation (6.35) becomes 

z(t) = A 1 e“V + A 2 e~ m r>‘ 

This still doesn’t look much like a sinusoidal waveform. However, by using the complex 
number identities, 

e 10 = cos0 + isin0 and e -10 = cos0 — isin0 
and the fact that A x and A 2 are a complex conjugate pair, it can be shown that 

z(t) = B cos ay + C sin ay ( 6 . 36 ) 

The constants B and C are found from the initial conditions of the problem. 


High damping 

If the damping level is high so that C 2 > 4 KM, the two roots, Xj 2 > are both real and 
negative. The response, as given by equation (6.35), is the sum of two decaying exponential 
functions. The constants A 1 and A 2 are found from the initial conditions as usual. 
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Critical damping 

A special case for the roots of equation (6.34) occurs if C 2 = 4 KM. This value of damping is 
known as ‘critical damping’, which is thus given by 

C crit = 2\fKM (6.37) 

This is an important relationship and represents the boundary between two types of response. 
As we shall see in the next section, for subcritical damping the system will oscillate about 
its equilibrium position before coming to rest. For high damping, the free response decays 
exponentially back to its equilibrium position without oscillation. 


From equation (6.34), 



In this situation, in order to maintain distinct parts to the solution, the response is given by 


z(t) = A 1 e “» f + A 2 te 


(6.38) 


Note the Y in the second term, which is needed to retain two distinct terms in the solution. 


Light damping 

The vast majority of engineering structures possess damping levels much less than critical. 
From common experience, we know that a structure left to vibrate freely will come to rest 
eventually. The light damping case is thus the norm. Mathematically, this corresponds to the 
case where C 2 < 4 KM and the roots of equation (6.34) are a complex conjugate pair: 


c_ + . Akm - c 2 

Al ' 2 2 M - 1 2 M 


(6.39) 


It is convenient to introduce the damping ratio, y = — — — n • — = „ rrrr-; 

1 & ' critical damping 2 VKM 

Using also the expression for the undamped natural frequency, oi„, equation (6.39) becomes 



Substitution into equation (6.35) gives a mathematically correct (but not very convenient) 
solution. 

z(t) = A i e ( “ 7 “” + + A 2 e ( ~ 7 “” “ (6.41 ) 

Again, making use of the complex exponential identities and the fact that A 1 and A 2 are a 
complex conjugate pair, equation (6.41) can be rewritten as 

z{t) = e^w(B, cos oi„Vl - y 2 t + B 2 sin <aj\ - y 2 t) (6.42) 

Equation (6.42) matches our intuition since it describes a sinusoidal waveform (indicated by 
the terms in the brackets) with an exponentially decaying term at the front that will cause the 
amplitude of the sinusoid to decrease. Note that the constants and B 2 are both real. As an 
alternative, equation (6.42) can be rewritten as 

z{t) = C 0 e _7 “"'cos (utjl - i|i) (6.43) 

As with the previous cases, the two constants can be found from the initial conditions. 

Note that equations (6.42) and (6.43) both show that the frequency of vibration is 

u}„\j 1 — y 2 . This is known as the damped natural frequency and is less than the undamped 

natural frequency, oi M . 
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Worked example 

When at rest in equilibrium, the mass receives an impulse J of magnitude 5 Ns 
applied at time, t = 0. Find the response for t > 0. 



Figure 6.47 

Equation of motion 

— 2kx — cx= mix 

or m'x + cx + 2kx = 0 




Note that the impulse J doesn’t appear on the free-body diagram or in the equation of 
motion since it doesn’t exist for t > 0. 

From the coefficients in the equation of motion, we find that 
y = =0.1. Since the damping ratio is less than 1.0 in this problem, we can 

identify 

that it is a case of light damping. From equation (6.42), we can write the expression for 
the response as: 


x (t) = p-y *M(Bi cos fl„ t + B 2 sin O,, t) 
•y 2 = 9.9 rad/s. 

x = 0 at t = 0 .'. 1 

x(t) = B 2 e~y b, » t sin fl„ 


where Cl n = o)„y 1 — 

First initial condition: 

Hence, 

The second initial condition is the velocity immediately after the impulse, x 0 . This can 
be found from the change in momentum caused by the impulse. 


That is: 
Therefore, 


m(x 0 - 0) 

_ J 


Differentiating the expression for displacement gives 

x = B 2 [0, n e~y u ‘« t cos f 1„ t — 'YG) (1 e - ' Y “"‘sin fl„ t] 

i 0 = ~ at t = 0, .'. £ = - 0] 

Hence, B 2 = and x(t) = sin fl„ t 

Substituting the numerical values gives x(t) = 0.0505 e _t sin9.9i [m] 
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Estimating damping 

While the mass and stiffiiess of a structure can normally be calculated, the structural damping is 
very difficult to predict. However, equations (6.42) and (6.43) show that the rate of decay of 
the free vibration of a structure depends directly on the damping ratio, and this gives a method 
of measuring damping. 

In the above worked example, suppose we didn’t know the damping value, but had done an 
experiment to measure the transient displacement from the impulse. Here is the measured 
response waveform. 



We know that the expression for the displacement is x(t ) = e~ 7<a ' |t sin fl„ t 
The amplitude of the first peak at t = q is X x = e -7 * 0 "* 1 

The second peak occurs one period later and has an amplitude X 2 = e~ 7<d " (tl + T ") 

The ratio of the peaks is therefore = e' Y “» T » 


The period of the damped oscillation is T n = - 


~ — — for light damping. 


Taking logs of both sides of the expression for the ratio of the peaks, we get: 

ln (§) = 2ir Y 


In this case, = 0.0431 m and X 2 = 0.0229 m, so that y = 0.101 and c = 20.1 Ns/m, which 
compare well with the exact values of 0.1 and 20.0 respectively. 

Note that the ratio of any two successive peaks is a constant, so there is good scope for making 
several estimates of the damping ratio from a measured response waveform. 


Harmonic excitation 

Solution of the equation of motion 

Mz + Cz + Kz =f(t) 

The complete solution for z(t) consists of a particular integral and the solution to the 
complementary function. The latter is simply the solution to the equivalent free vibration 
problem considered earlier in this section. For non-zero damping, all these solutions decay to 
zero with time and they describe the transient response of the structure due to the sudden 
start of the excitation. In practice, the transient response normally decays quickly. As an 
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example, the graph below shows the complete solution for the response of a single-degree- 
of-freedom system due to the start of a cosine wave excitation. Near the start, we see the 
decaying transient response superimposed on the particular integral solution. 



After the transient response has decayed, we are left with the particular integral, which 
continues for as long as the excitation remains. It’s normally called the steady-state response 
and, in most cases, this is all we are interested in. 

Method 1 Direct Substitution 

Consider harmonic excitation of the form ,f(t) = F cos (lit 

For pure sinusoidal excitation, the response is also sinusoidal. The response has the same 
frequency as the excitation, but the two are likely to have a phase difference. The key 
information we want from the analysis is the amplitude of the response and its phase 
relative to the excitation. A suitable expression for the response is therefore 

z(t) = Zcos (bit + a) 

where Z is the amplitude of the vibration and a is the phase angle. 

To find Z and a, we substitute for z(t) and its derivatives in the equation of motion, expand 
the various trigonometric terms and equate the coefficients of cos bit and sin bit. This gives 

Z .:(K - Mo 2 } 2 + 0) Z C 2 

” d 

Note from equation (6.46) that the phase angle a. is negative, indicating that the response of a 
system lags the excitation. 
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Since the steady-state response to sinusoidal excitation is also sinusoidal and with the same 
frequency as the excitation, the key parameters to he identified are the amplitude and phase 
angle. Instead of drawing conventional time waveforms, a convenient way of showing the 
amplitude and phase relationship between excitation and response is with a phasor diagram 
(Figure 6.52). 

Method 2 Complex algebra 

This provides a mathematically convenient way of finding the amplitude and phase angle of 
the response. It has the added advantage that the same mathematical approach can be extended 
to more complicated (and therefore more realistic) structures, to more general forms of 
excitation and to experimental testing and digital data analysis procedures. 

The substitutions used are always the same; that is, 

Put /(f) = F e'“ ( (6.47) 

and z(t) = Ze ,(o) ' + “) 

= (Ze'“)e“' 

= Z*e ,w (6.48) 

where Z* is a complex number = Z( cos a + i sin a) 

Differentiating z(t), z = iu>Z*& wt and z = — w 2 Z*e 1 “ < 

When these are substituted into the equation of motion, we get 

(-Mat 2 + imC + K)Z* = Fe'“' 

Hence Z ' = IK M,Z) + intC 

or z* - K K - M “ 2 > - '“Cl 

or Z* = a + ib 


Z = F 

V(K - Ma) 2 ) 2 + (o 2 C 2 



Plotting Z* and F on the complex (Argand) plane (Figure 6.52), gives the same phasor 
diagram discussed before (Figure 6.53). Note that in practice, the imaginary part of Z* is 
always negative. As a result, a is also negative, meaning that the response lags behind the 
excitation (Figure 6.54). 
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Frequency response function 

To see how the excitation frequency affects the response, we will consider the frequency 
response function (FRF). By definition, this is the response/unit applied force. Note that 
it is used exclusively for force excitation. 

Start with the general form of the equation of motion, 

Mz + Cz + Kz =f(t) 

C 

Dividing by M and noting that ~rz = 2yco„, we get 


z + 2yo o n z + (o^z 


M 


With /(f) - Fe- and *(,) - Z*e- Z> - | j (a ,a _ ,2^,0, 

The FRF i, therefore: H(at) = f = (m 2 - ^ 

An alternative expression that emphasizes the frequency dependence is obtained by dividing 
top and bottom by and noting that = K gives 


The FRF magnitude is 


|H(ca)| = 


and the phase angle is 



(6.50) 


(6.51) 


It is clear from these expressions that the response of the structure (its amplitude and phase 
angle) depends on the ratio between the excitation frequency, «, and the undamped natural 
frequency, u>„ and on the damping ratio, y. 

Figure 6.55 shows how the FRF amplitude and phase angle vary with both frequency ratio 
and damping ratio. 
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Frequency ratio 



0 

0.02 

— 0.05 

— 0.1 
— 0.2 
— 0.5 


Figure 6.55 Amplitude (top) and phase angle (bottom) of FRF 
plotted against frequency ratio 


The graphs have been plotted for several values of damping ratio. Most structures have 
damping ratios of less than 0.1 and it can be seen that the resonant peak (the maximum 
response near the frequency ratio of 1.0) is large. Increasing damping will reduce the height 
of this resonant peak. In the phase angle plot, we see that for low values of damping ratio, 
the phase angle is close to 0° (i.e. the response is roughly in phase with the excitation) for 
frequencies below the undamped natural frequency and close to —180° (roughly out-of-phase) 
above the undamped natural frequency. 



Figure 6.56 Imaginary part of the FRF plotted against the real part (y = 001) 


Structural vibration 


The frequency giving the maximum response is called the resonant frequency. With low 
damping, the resonant frequency, the undamped natural frequency and the damped natural 
frequency are all virtually identical. For higher damping (look at the curve for y = 0.2), the 
resonant frequency is less than the undamped natural frequency. 

Each point on the real vs. imaginary plot shown above gives the phasor diagram we met before 
(the arrow illustrates one particular frequency). Although this is not shown explicitly, we move 
around the plot as the frequency ratio varies. Comparing this with Figure 6.56, when cu = 0, 
the amplitude is 1.0 and the phase angle is 0°. When to = the phase angle is —90° and the 
response is purely imaginary. As u tends to infinity, the amplitude tends to zero and the phase 
angle tends to —180°. 


Worked example - Use of the frequency response function 

Find the waveform of the steady-state vertical displacement of a reciprocating 
air compressor (mass, m = 4000 kg) that operates at a crank speed, ft, of 
300rev/min. The machine is supported on a set of four resilient mounts that 
have an overall vertical stiffness of2.5MN/m and a damping ratio of 0.04. 

The effect of the masses of the reciprocating pistons is to produce a vertical 
force on the compressor given by 

s(t) = 1.3 ft 2 cos ftt + 3.0 ft 2 cos 2ftt = cos oqt + S 2 cos a> 2 t 

The compressor can be modelled as a rigid mass and its mounts as a spring— damper 
combination as shown in Figure 6.57. Each sinusoidal term in the excitation will 
produce a steady-state response that is sinusoidal with the same frequency as the 
excitation. For example, the term .S) cos W] f will produce a response in the form 
Xj 005(0 )! t + a,). We can use the frequency response function to do this. 

i 

I 

J 


Once the response to each excitation term has been found, the total response is obtained 
simply by adding the two together. 

For each excitation term, we have X* = X S ; so that Xj = |X*| = H(oy') X Sj 

where j = 1 or 2. We can use the equations (6.50) and (6.51) to work out the FRF 
magnitude and phase angle values. 

Using the data given, 

S t = 1283 N, | ET (tOi) | = 6.81 X 10“ 4 mm/N, a! = -170°, Xj = 0.87mm 
S 2 = 2961 N, | H (a) 2 ) | = 7.52 X 10“ 5 mm/N, a 2 = -178°, X 2 = 0.22 mm 
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Time (s) 

Figure 6.59 Showing the total response by adding the two 
waveforms together 


The term ‘primary’ relates to the fact that the vibration frequency, w 1; is equal to the 
rotational frequency of the crank. The ‘secondary’ component has a frequency that is 
twice the rotational frequency of the crank. 


Periodic excitation 

Structures are rarely subjected to pure sinusoidal excitation in their operating environment 
and it may therefore seem strange that so much time is devoted to it in vibration textbooks. 

In many situations, however, periodic excitation is present, a simple example being the air 
compressor problem considered in the previous section. 

If we consider the system shown in Figure 6.60, the normal mathematical approach for solving 
the differential equation 

mx + cx+ kx = p(t) 

is to ‘guess’ an appropriate expression for the particular integral. This was successful in the case 
of sinusoidal excitation, but it can be difficult (or even impossible) for many other excitation 
functions. The method used here is an extension of the approach used for the air compressor 
problem. 
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For a periodic waveform that repeats exacdy every T seconds, Fourier series analysis or its 
digital equivalent, the discrete Fourier transform (DFT), can split it into a series of sine and 
cosine components. In this case, we will get 

p(t) = A 0 + £[4cos(iM + Bjsm(jh) 0 t)] 

1 2tt 

where the repetition frequency, o) 0 = -j, [Hz] = ~jT [rad/s] . 

The term in the square brackets represents a general sinusoidal wave with frequency jut 0 . The 
steady-state response to this general term will also be sinusoidal with the same frequency and 
we can calculate this using the frequency response function in much the same way that we did 
for the air compressor problem. If we do this for all of the terms in the series, we can then add 
all the individual responses together to give the overall response. 



Figure 6.60 


The process therefore is as follows: 


Step 1 Use Fourier series analysis to decompose the excitation waveform into a set of 
sinusoids. 

We can use the complex number representation of the waveforms to good effect here. 
For a typical pair of components we write 

Aj cos(j ut 0 t) + Bjsm(ju> () t) as P*e i - , “°* where P* = A- + iB ; 

(If the DFT is used, this complex number form is calculated directly.) 

Step 2 The response to this excitation can be calculated from the frequency response 
function, so that 

X* = H(ju > 0 ) X P* 

The response term, Xj , will have the form C ; + i Dj which can be interpreted as 
Cjcos(ju> 0 t) + DjSin(ju) 0 t) 

Step 3 Having done this for all frequencies in the excitation, all of the individual sinusoidal 
responses can be summed together. 


As an example, Figure 6.61 shows a periodic excitation in the form of a series of rectangular 
force pulses with a repetition period of 0.25 s (i.e. a repetition frequency of 4 Hz). The 
response of a single-degree-of-freedom system is shown in Figure 6.62. 



Time (s) 


Figure 6.61 The 4 Hz force pulse waveform 


Figure 6.63 gives the results of Step 1 and shows the Aj (top) and B,- (bottom) Fourier 
coefficients of the rectangular pulses plotted against a horizontal frequency axis. In addition to 
the A 0 at 0 Hz on the top graph, the components appear at 4 Hz, 8 Hz, 12 Hz, ... (i.e. the j (o 0 
terms from the Fourier series, which are at integer multiples of the repetition frequency) and 
some are positive, some negative. In this case, the coefficients up to j = 12 are shown. 
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Figure 6.62 Response of single-degree-of-freedom system 


[Note that since cos(0) = 1 and sin(O) = 0, the A 0 term can be seen as a special case of the 
general summation when j = 0.] 




Figure 6.63 Cosine (top) and sine (bottom) coefficients of the Fourier series 
expansion of the excitation waveform up to j = 12 


Figure 6.64 shows the frequency response function for the system in the example, with the real 
and imaginary parts plotted against frequency. The system has an undamped natural frequency 
of 25 Hz and a damping ratio of 0.2. 

In Step 2, we perform the complex multiplications between the FRF and excitation 
components. As an example, component 5 (at 20 Hz) gives the following data. 

P* 5 = -2.98 + i3.77 N 
H (20 Hz) = 0.0629 - i0.0559mm/N 
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g- Frequency (Hz) 

^ boo- 

's- -0.02 

-0.04 
o -0.04 
H -0.04 
| -0.04 
'3) -0.06 

Figure 6.64 Real and imaginary parts of the frequency response 
function, natural frequency = 25 Hz, damping ratio = 0.2 



These points are highlighted in Figures 6.63 and 6.64. After the appropriate complex 
multiplication we get 

P* 5 = -0.376 - i 0.045 mm 

Repeating this process by hand for a large number of components would be very laborious, 
but software (such as Matlab) that is able to do the necessary complex arithmetic using the 
Discrete Fourier Transform is widely available. 

Component 5 is highlighted in Figure 6.65, which shows all of the Cj (top) and Dj (bottom) 
terms plotted against frequency. These are the cosine and sine coefficients of the response 
waveform, which are added together in Step 3. The resulting waveform is shown in Figure 
6.62. 

Figure 6.66 shows how the response waveform builds up as more terms are added. Figure 
6.66(a) shows the constant term and the first three sinusoidal components (up to j = 5, or 
20 Hz) and Figure 6.66(b) includes up to j = 13 (52 Hz). The waveform does not change 
much once the j = 6 term (24 Hz) has been added. This is because the excitation components 
are smaller above this frequency (Figure 6.63) and the FRF magnitude is also reducing (Figure 
6.64). 

Notes 

(1) The above procedure will work for any periodic excitation. 

(2) The frequency response function used here is the same expression that was derived using 
pure sinusoidal excitation. We are thus able to use the result from a mathematically 
convenient excitation case (but one which is not often experienced in practice) to find the 
response to a more complicated (and practical) excitation. 

(3) Most engineering structures have more than one resonant frequency. Provided the 
frequency response function can be calculated (as explained in Section 6.4) or measured (as 
in Section 6.5), the same procedure can be used. 


421 


An Introduction to Mechanical Engineering: Part 2 






Figure 6.66 The summation process in Step 3: (a) terms up to j = 5; (b) terms up to j = 13 
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As an example, Figure 6.67(a) shows the magnitude plot of the frequency response function 
of a structure that has five resonant frequencies in the range from 0 to 350 Hz. To find the 
response of this system to the same 4 Hz pulse sequence, the 3-step procedure is repeated, but 
using the new frequency response function instead. The result is shown in Figure 6.67(h). 




Figure 6.67 (a) Magnitude plot of FRF of a five-degrees-of-freedom system and 
(b) its response to the 4 Hz force pulses 


Apart from some obvious differences in detail between this and the single-degree-of-freedom 
response waveform, the general shape and amplitude are similar for both. The main reason 
for this is that the original single-degree-of-freedom system is the same as the lowest natural 
frequency of the five-degrees-of-freedom system. The single-degree-of-freedom system is 
therefore a good approximation to the more complicated five-degrees-of-freedom structure. 
In Section 6.6 we will discuss how we can create single-degree-of-freedom approximations to 
more complicated structures. 

Overall, the method is very powerful as it can be applied to any structure subjected to any 
periodic excitation. 


Learning summary 

By the end of this section you should have learnt: 

/ to obtain the response of single-degree-of-freedom mass-spring-damper systems for the cases of 
‘free’ vibration and for harmonic (sinusoidal) and arbitrary periodic excitation. 
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6.4 Response of damped 

multi-degree-of-freedom systems 

Introduction 

Section 6.2 covered free, undamped vibration problems in the form, 

[M]{x} + [K]{x} = {0} 

These were solved with the substitution, { x(t) } = {X}cos cot, and led to the eigenvalue 
problem, 

([ K ] - a ) 2 [M]){X} = {0} (6.52) 

Solving this gave the natural frequency, o) r , and modal vectors, {X}, for each mode. 

Section 6.3 discussed the forced response of damped single-degree-of-freedom systems. 

This section extends this to damped systems with many degrees of freedom. The inclusion 
of damping and excitation adds extra forces to the free-body diagrams and produces a set of 
equations in the form, 

[M]{x} + {C]{x} + [X]{*} = {p{t)} (6.53) 

These equations are coupled, implying that they need to be solved simultaneously. It is, 
however, possible to uncouple the equations to give a set of individual second-order ordinary 
differential equations, each with only one response variable. These look just like single-degree- 
of-freedom equations and can be solved using the methods described in Section 6.3. 

The steps involved are as follows: 

Step 1 Uncouple the equations of motion to give a set of individual second-order ordinary 
differential equations 
Step 2 Solve each equation 

Step 3 Combine the individual solutions to give the actual response of the system. 

Orthogonality of modes 

A property of the modal vectors of the undamped system holds the key to uncoupling 
equation (6.53). 

For a typical mode, r, we can rearrange equation (6.52) to give 
[X]{X},.= o) r 2 [M]{X} f 

Premultiplying by the transpose of a different mode shape vector, {X} s , we get 
{X} S T [X]{X} r = <j) f 2 r {X}j T [M]{X} r 
Similarly, by starting with mode s, we can get 

{X} r T [K\{X} s = co s 2 {X} r T [M]{X} s 
Since [X] and [M] are symmetric, it follows that 

{X}/ [K]{X} S = {X} r T [K]{X} r and {X}/ [M]{X} 5 = {X}/ [M]{X} 
Subtracting (6.55) from (6.54) gives 

0 = K 2 -^){X} r T [M]{X} s 
Thus, provided that o r # to s , {X } f 7 [M]{X} ( = 0 

from which it follows that { X } r 7 [X]{X} S = 0 


(6.54) 

(6.55) 

(6.56) 

(6.57) 
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For r = s, {X} r T [M]{X} r = M r 

which is called the modal mass (a scalar) for mode r 

and {X} r T [K]{X} r = K r ( 6 . 59 ) 

which is the modal stiffness for mode r. Together, these equations describe the property of 
orthogonality for different modes of the system and they show that each individual mode 
shape is unique and independent of the others. 


Modal matrix and modal scaling 

The so-called modal matrix, [<&], is obtained by assembling all of the modal vectors, column 
by column. That is, 

[d>] = [{x } 1 {x } 2 ... {x>, ... ] 


Thus, column r of the modal matrix is the modal vector for mode r, with each element giving 
the amplitude of the displacement in each coordinate for that mode. 





Written out long-hand, the modal matrix is: 



It is important to get a clear understanding of the notation being used here since, for example, 
X 13 is not the same as X 31 . In particular, note that 

the first suffix (i.e. the row number) = response coordinate number 

and the second suffix (i.e. the column number) = mode number. 

Thus, X 13 is the response in coordinate 1 for the third mode of vibration and X 31 is the 
response in coordinate 3 for the first mode of vibration. 

Using equations (6.56) to (6.59), it is easy to show that 


and 




M r 


[<Df[X][0] 


K r 


These are the modal mass and modal stiffness matrices and it will be seen that they are both 
diagonal matrices. It is these results that enable the equations of motion to be uncoupled. 

Modal vectors have an arbitrary scaling. In equation (6.11), for example, the mode shape 
vectors were found by setting the amplitude in one of the coordinates to 1.0. The particular 
scaling adopted also affects the numerical values of the modal masses and stiffnesses. As a result, 
these contain an arbitrary factor too. 
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It is common to choose to scale the vectors so that all of the modal masses are equal to unity. 
The benefit of this will become apparent later. Starting with an arbitrarily scaled vector 
{X}, and the resulting modal mass M r , the modal vector can be scaled (or normalized) to 
unit modal mass using the expression, 


{U} r 


VQr 


(6.60) 


There is an example of this later. 


One advantage of using this method of scaling is that, in addition to having all of the modal 
masses equal to unity, the modal stiffness values are numerically equal to the square of the 
relevant natural frequency, that is, M r =1.0 and K r = w r 2 for all modes. 


Forced response of an undamped multi-degree-of-freedom structure 

Before considering a damped multi-degree-of-freedom structure, we will consider the 
equivalent undamped problem. The coupled equations of motion are 

[M]{x} + [*]{*} = {p(t)} (6.61) 

For step 1 of the solution process, we uncouple the equations by introducing the substitution 

M = m{q] (6.62) 

where {q} is a vector of modal coordinates. 

Substituting into equation (6.61) and premultiplying by [OJ T gives 

m T mm{q} + = p] T { P m 

Using the orthogonality properties for modal vectors and assuming that they have been scaled 
to unit modal mass, we get 


im) + 


W r {<?} = [ £ f J ] 7 {p(0 ) = {/(f)} say- 


(6.63) 


Just as equation (6.61) describes the equations of motion in terms of the physical coordinates, 
equation (6.63) describes the equations of motion in terms of the modal coordinates. Equation 
(6.62) can be seen mathematically as a vector space transformation, taking the equations from 
physical space (described by the physical displacement coordinates in {x}) to modal space 
(described by the modal coordinates {q}). 

Because the two matrices on the left-hand side of equation (6.63) are both diagonal matrices, 
the individual equations are not coupled and each involves only one mode. The equation for a 
typical mode r has the form 

q r + <* r 2 q r =f r (t) (6.64) 

Note that if the vectors have not been scaled to unit modal mass, the substitution of equation 
(6.62) into (6.61) still results in uncoupled equations, but each then has the form 

M r q r + K r q r = f r (t) 

In either case, there will be as many individual equations as there are modes of the structure, 
with each one involving only one of the modal coordinates. 

Each equation describes a single mode of vibration and will tell us how much that mode 
contributes to the overall response. This has the natural frequency ai r and is subjected to 
excitation defined by f r (t). Solutions for free vibration, harmonic and periodic excitation were 
described in Section 6.3, and solutions using Laplace transforms are presented in Unit 4. The 
fact that these are modal rather than physical space equations does not affect the mathematics. 
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The solutions to the modal space equations are the responses of each individual mode of 
vibration expressed in terms of the modal coordinates, q r (t). Finding the expressions for these 
responses is step 2 of the solution process. 


Once we have them, we can obtain the response in each of the physical coordinates by reusing 
equation (6.62), 


{x} = mu) 

This is step 3 of the solution process. 

Written out longhand, these equations are 

' u n u 12 ■■■ u 




The response in a typical coordinate, j, is therefore 

Xj{t) = ]T u jr q r (t) 

It can be seen that equation (6.65) states that the response of the structure at any instant is a 
weighted sum of the individual modal vectors. The modal coordinates, q r (t), define how much 
each mode shape contributes to the structure’s overall response at each instant. 


Worked example 

Find the steady-state response of each mass in the following two-degrees-of- 
freedom system to a sinusoidal force of amplitude 10 N and frequency 6 Hz 
applied to the 1 kg mass. 



Using the method presented in Section 6.2, the equations of motion, natural frequencies 
and modal vectors are obtained first. 

n o] f*,l r 2000 -10001 r*,i MO] 

[o 2J1* 2 J 1-1000 200oJW 1 0 J 

The eigenvalues are w, 2 = 634 s -2 and « 2 2 = 2366 s -2 giving natural frequencies of 
4.0 Hz and 7.7 Hz respectively. 
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The corresponding mode shape vectors are 


POi 



f 0.7321 
1 1.0 J 


and 


{X} 2 



in 


These vectors are scaled by making X 2 = 1.0 for each mode. Using equations (6.58) and 
(6.59), the modal mass and modal stiffness values are: 

M x = 2.54 ,M 2 = 9.46 ,K X = 1608 and K 2 = 22 392. 

Rescaling to unit modal mass using equation (6.58) gives the new modal matrix: 


rol = [0.460 -0.8881 
1 J 1 0.628 0.325 J 

Using the new modal matrix, the reader is left to confirm that [ < t ) ] 7 [A4] [ ( b] = |q ^ j 

and that [0] T [X][<I>] = f ^ 1. As mentioned earlier, the new modal stiffnesses of 

L 0 2366 J 

fCj = 634 and K 2 = 2366 are numerically equal to the squares of the corresponding 
natural frequencies. 

From {/(f)} = [ c b] 7 {p(f) }, the modal forces are 

[ 0.460 0.628] [Pi(0] = ( 0.460 X Pl (t) j 

l/ 2 (f)J [-0.888 0.325 J1 0 J [-0.888 Xp/t)) 

The modal equation for mode 1 is 

ii + w i 2 3i =/i(0 

or q x + 634 3l = 0.460p!(f) 


For the steady-state response, put p,(f) = 1 0e'“' and q r (t) = Q{ e'“' into (6.66) with an 
excitation frequency of 6 Hz to give: 

Q '° 634 - Tx 2^ "-°- M584 

Similarly for mode 2, 

^ = 2366 f( 6 8 X 2^ °- a00940 

From equation (6.65), 

fx{e‘ w< | _ [0.460 —0.8881 f 

[x^eH L0.628 0.325 J[Q2e ,(0t J 

Hence, X* = 0.460 X q£ - 0.888 X Q 2 = 5.66 mm 

and X 2 = 0.628 X Q* + 0.325 X Q 2 = -6.72 mm 


Notice that X* is positive, showing that mass 1 moves in phase with the force applied to 
it, while the negative sign for X 2 shows that mass 2 moves 180° out of phase with the 
applied force. 

Repeating the exercise across a range of frequencies, the modal responses and the 
displacements of the two masses are as shown in Figures 6.69 and 6.70. 
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XI (mm) 

X2(mm) 

--“2 



We see from the modal responses that each produces an infinite response if the excitation 
frequency coincides with the relevant natural frequency (4.0 Hz and 7.7 Hz, in this case). At 
frequencies in the vicinity of each natural frequency, the relative displacements of the two 
masses will closely resemble the proportions given by the relevant mode shape. In the vicinity 
of o)i, for example, | Qi| | Q 2 I so the first mode will dominate the response. The chosen 
excitation frequency of 6 Hz is about midway between the two natural frequencies and we 
find that the two modes make similar contributions at this frequency. 

Looking at the displacements of the masses, it’s interesting to note that at about 5.5 Hz, mass 1 
has a displacement amplitude of zero. This is perhaps surprising when you consider that the force 
is applied to this mass. The ability to design a dynamic system that has zero displacement at the 
point of force application is the basis for devices called tuned vibration absorbers. These were 
used, for example, to help cure the resonant vibration problem on London’s Millennium Bridge. 

Forced response of a damped structure 

As with single-degree-of-ffeedom systems, the presence of damping complicates the analysis. 
The equations of motion are 

[M]{x} + [C]{*} + [K]{x} = {p(t)} 

In the undamped case, the equations could be uncoupled by transforming into modal space 
using the modal matrix [O], We can try the same approach here. 
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For step 1, put {x} = [cl 3 ] {q}, where [O] is the modal matrix of the undamped system. 
Substituting into the equation of motion and premultiplying by [*S >] T gives 

m T mm{q} + m r icm{q} + mien®] ^ = m r { P m 


Using the orthogonality conditions for modal vectors, and assuming that they have been scaled 
to unit modal mass, we get 


m{«} + m'lcmiq} + 


co? u } = m r {p(t)} = {fit)} 


In general, the modal damping matrix, [®] qU] [®], will not be a diagonal matrix and the 
equations will not uncouple. The exception is if [C] can be written in the form 

[C] = a x [M] + a 2 [K\ 

where a x and a 2 are constants. In this case, the equations will uncouple. 

Engineers commonly assume that [C] does have the above form (even when it is not strictly 
true), an assumption known as proportional damping. The assumption considerably 
simplifies the analysis and the justification is that, since the damping in most real structures 
is low, the error introduced is found to he acceptably small. If we make the assumption of 
proportional damping, the modal damping matrix is 

[0] T [C][0] = 2y r u> r 


where y r is the damping ratio for mode r. The modal space equation for mode r is 
q T + 2y r h i r q r + a> 2 q r = fit) 

Steps 2 and 3 of the solution are then the same as before. An example of this is used in the 
next section to find an expression for the frequency response function. 


Frequency response function 

The frequency response function for a multi-degree-of-freedom system can be calculated using 
a simple extension of the method used in Section 6.3 for a single-degree-of-ffeedom system. 
We will consider the case of sinusoidal excitation applied to coordinate k and find the response 
in coordinate j. 

Section 6.4 showed that the equations of motion can be uncoupled to give a set of equations 
describing the behaviour of each mode. For mode r, 

4r + 2 y f w r 5 r + u> r 2 q, =f r (f) 


The modal force vector, {f(t )}, is calculated from [ ( 1 3 ] r {p(t)}. In this case, only the feth 
coordinate has excitation; all the other elements in the vector {p(t)} are zero. 


0 


m) 


0 

Pk(t) 

0 


0 
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From {/(f)} = [<l)] 7 {p(f)}, we find that /(f) = u kr p k (t). 

For harmonic excitation, put/(f) = Mj, r Pj,e la)( and q r (t) = Q r e ia>t into the modal equation of 
motion. 

* UyPk 

(to 2 — to 2 ) + i2 y r u> r u> 

u kr 

or ^ r ® (to 2 — to 2 ) + i2y r co r io Pj,e lmf 

Combining the individual modal responses gives the response in coordinate j to be: 


Xj (t) = X*e ia>t = 0 


j (to r 2 — w 2 ) + i2y r (D r co P fe e iwt 


The frequency response function giving the response in coordinate} due ti 
force in coordinate k is therefore: 


^ Ujrtikr 

H jk(v) - p k - 2_^(u) r 2 - to 2 ) + i 2y,to 


sinusoidal 


(6.67) 


Learning summary ^ 

By the end of this section you should have learnt: 

/ to uncouple the equations of motion of a proportionally damped multi-degree-of-freedom structure 
by making use of the orthogonality properties of the modal matrix; 
y to formulate response expressions, including the frequency response function. 

V y 


6.5 Experimental modal analysis 

Introduction 

Methods for predicting the natural frequencies and mode shapes of different types of structure 
were discussed in Section 6.2. In each case, approximations were required to derive the model. 
Moreover, none of the models was able to predict the damping level in the structure. 
Experimental modal analysis tests a structure as it is, without the need for modelling 
approximations. For each mode of vibration within a chosen frequency range, it identifies the 
natural frequency, damping and mode shape. These are called the modal parameters. 

As an example, Figure 6.71 illustrates the first bending and first torsional mode shapes obtained 
from a test on a free— free beam. There are animated versions of these mode shapes on the 
book’s supporting website. While many structures (such as helicopter rotor blades and lighting 
columns) have beam-like characteristics, few meet the idealized conditions incorporated into 
these theoretical models. 

In this particular test, each mode shape has been defined in terms of the amplitude and relative 
phase of the vertical motion at 18 points on the beam (9 along each edge). Each point vibrates 
sinusoidally about the undisplaced (equilibrium) position, shown by the dotted lines in the 
figure. Figure 6.71 itself shows ‘snapshots’ of the beam displaced when it is at one extreme of 
this motion. As seen previously, some points on the beam (indicated by the chain-dashed lines) 
are seen to be stationary. These are called nodal lines and are unique to each mode shape. 

A common way of performing the tests is to measure a set of frequency response functions and 
then find the modal parameters using a curve fitting procedure. 
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First bending mode 



First torsional mode 

Figure 6.71 


Frequency response function testing 

The frequency response function (FRF) defines the response per unit applied force across a 
range of frequencies. This was discussed in Section 6.3 where the FRF for a single-degree- 
of-freedom system was derived. A sinusoidal force input was used and an expression for the 
response displacement (expressed in terms of the steady-state amplitude and phase) was found. 
The FRF for an excitation frequency oo was shown (equation (6.49)) to be: 

H(co) — 2 J , (6.68) 

w F M (w„ — or) + i2yto„w 

where Z* is a complex number containing amplitude and phase information about the 
response, F is the force amplitude, M is the coefficient of acceleration in the equation of 
motion and w is the excitation frequency. The expression also contains two of the modal 
parameters; w„, which is the undamped natural frequency and y, the damping ratio. 

The amplitude and phase spectra are shown in Figure 6.72, plotted against the frequency ratio, 
w/a)„. These were discussed in detail in Section 6.3. 



Frequency ratio 

Figure 6.72 


Frequency ratio 
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If a sinusoidal force was applied to a structure and the steady-state amplitude and phase of the 
response was measured, one point on each graph would be found. The complete FRF plot 
could be measured by repeating this test at many different excitation frequencies. This is called 
a swept-sine test. 



Figure 6.73 


For a single-degree-of-freedom system, there is one resonant peak. For a multi-degree-of- 
freedom system, there would he several resonant peaks, but the swept-sine test would still 
work. 

Figure 6.73 is an example of an FRF of a multi-degree-of-freedom system. This one has five 
resonant peaks, indicating five modes of vibration in the frequency range. Note that each 
peak (such as the one highlighted in the box) has a related phase shift of 180°. This is the same 
characteristic seen in the FRF for the single-degree-of-freedom system in Figure 6.72. 

Since a single-degree-of-freedom system has only one coordinate to describe it, the response 
in that coordinate tells us all we need to know. For a multi-degree-of-freedom system, the 
displacement varies from point to point on the structure. At each resonance, the relative 
amplitude and phase of the different points is called the mode shape and is different from one 
resonance to the next. 

Section 6.4 showed - in equation (6.67) — that the FRF for a multi-degree-of-freedom system 
giving the response in coordinate j due to a unit sinusoidal force in coordinate k is: 

«*<“> - f - E - JT+ i2w» |6 - 691 

where the summation is taken over all of the modes, r, of the structure, and (O r and y r are the 
undamped natural frequency and damping ratio for mode number r. Uj r and u kr are the mode 
shape values in coordinates j and k for mode r. 

This can be rewritten as: 

«*<“> = £ H - g) + i2 w o, I 6 - 70 ' 

The terms = Uj r u kr are called ‘residues’ and are simply the product of the mode shape 
values of the structure at the response and excitation positions for each mode. 
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You can see that each of the five modes - i.e. each term in the summation in equation (6.70) 

- is described by an expression like equation (6.68). In particular, if there was only one mode, 
so that the summation in equation (6.70) had only one term, and the residue was equal to 
then it would be identical to equation (6.68). When deriving equation (6.69), the mode shapes 
were scaled to give modal masses equal to 1.0 and this is why there is no term. 

Measuring FRFs lies at the heart of experimental modal analysis. In principle, this can be done 
by applying a known force to the structure at point k, measuring its response at point j and 
then dividing one by the other. That is: 

Hpfa) = (6.71) 

An ‘impact test’ is a simple and effective technique for small, lightly damped structures. In 
a typical set-up for such a test, the structure is excited by a tap from a special instrumented 
hammer containing a force transducer. An accelerometer attached to a point on the structure 
measures the response. A set-up for a test on a free— free beam is shown in Figure 6.74. 



Figure 6.74 (Top) Beam supported in a cradle using rubber 
bands to simulate a free-free condition An instrumented 
hammer is next to the beam; (Bottom) Accelerometer 
measuring the vertical response at one corner (Point #1) 


The impact excites a wide range of frequencies simultaneously and the resulting transient 
response is picked up by the accelerometer. Typical force and response signals are shown in 
Figure 6.75. 

The tap from the hammer excites many modes of the structure simultaneously. Each mode 
behaves like a separate single-degree-of-ffeedom system and will give a transient response 
consisting of a decaying sinusoidal vibration at that mode’s natural frequency. The overall 


434 




Structural vibration 


z. l til lanlMl*! (■■t 


+0 -I 

3D \ 

f > 






iJ 




















l 

I ™ 


Figure 6.75 Typical force and response signals 


response is the combination of the individual decaying oscillations from all of the modes in 
the range. Provided that the structure behaves as a linear system, the principle of superposition 
applies, and the individual modal responses will not interact with one another. 

The force and response are both time domain signals, but they are clearly not sine waves as 
used in the theoretical derivation of the FRF. Once digitized, the force and response signals 
can, however, be analysed using the discrete Fourier transform to identify the individual 
frequency components they comprise. For each pair of frequency components, dividing 
response by force gives the FRF at that frequency. 


co) 

H ^ = F^) (6 ' 72) 

Doing this for all of the frequencies gives the complete FRF. An example was shown in 
Figure 6.73. 

Mode identification 

It is apparent from equation (6.70) that the frequency response function embodies the modal 
properties of the system (natural frequency, damping and mode shape). It follows that if 
you can find values of (/R) r , co,, and y r for each mode that make equation (6.70) match the 
measured FRF, you have a means of identifying the modal parameters. The matching can be 
done by curve fitting an analytical fit function such as equation (6.70) to measured FRFs. This 
is the basis for many commercial parameter identification programs. 
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In the earlier example of a free-free beam, the vertical coordinate at 18 measurement points 
were selected. If we used every combination of force and response coordinate, it would be 
possible to measure a total of 324 (i.e. 18 X 18) separate FRFs. These tests could be put into 
matrix form as: 



In this matrix, H\ 2 is the response in coordinate 1 due to a force applied in coordinate 2, etc. 
Fortunately, it’s only necessary to measure one row or one column of the matrix of FRFs. If 
we were using an impact test, we might place the accelerometer to measure the response at 
coordinate 1 due to forces applied in turn at coordinates 1 to 18. Doing this would give us the 
top row of the matrix. 

Equation (6.70) shows that each FRF can be computed if the residue, natural frequency and 
damping values are known for each mode. Conversely, the right-hand side of equation (6.70) 
can be thought of as a parametric fitting curve in which the modal parameters must be selected 
to give the best fit between the computed FRF and the measured one. General software 
packages such as Matlab can be programmed to do the fitting and there are several dedicated 
commercial programs on the market too. A variety of algorithms are used to do the curve 
fitting, but a description of these is outside the scope of this book. 

The frequency and damping of a mode are global properties; they are the same regardless of 
the measurement positions. However, the residue for each mode, (A fe ) f will be different for 
each of the FRFs. 

Once the curve fitting has been done for all of the FRFs, there will be residue values for each 
of the measurement coordinates and for each mode. In the beam example, if we had measured 
the top row of the FRF matrix, the curve fitting process would have given us the following 
vector of residues for each mode: 

Mt.i A u 2 ••• A us } r ( 6 . 74 ) 

Recalling that the residues are simply the products of mode shape values, equation (6.74) can 
be rewritten as: 


{(«l) r -(»l) r (u X ) r '{u 2 ) r ••• (Ml), ‘(Wig),} 

You can see that ( U\) r is a common factor in every term in this case. This value can be found 
provided that one of the measurements has the force applied at the same coordinate that the 
response is measured at. In the example being used, this is coordinate #1. Once you have the 
value of (H i , i ) r, the common factor can be found from equation (6.76). 

(%)/ («i)r = (A,l)r 


or («,), = 

Hence, the complete mode shape vector can be found from the corresponding residue vector 
using equation (6.75). 

(Ai f!) r 

( 6 . 77 ) 


Learning summary 

By the end of this section you should have learnt: 

/ how the expression for the frequency response function for a multi-degree-of-freedom structure 
can be used to identify the natural frequencies, damping and mode shapes of structures from 
experimental tests. 
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6.6 Approximate methods 

Introduction 

Often the lowest natural frequency of a structure is the most important. For example, if the 
first critical speed of a shaft is above its operating speed range, whirl is avoided. It is often the 
case that the first mode gives the largest displacement for a given excitation. 

This section presents two methods for estimating the lowest natural frequency and shows how 
single-degree-of-freedom approximations of more complex systems can be created. 


Dunkerley’s method 

Stanley Dunkerley developed a method of estimating the lowest critical speed of shafts. His 
formula can be used to estimate the lowest natural frequency of any system. 


co„ uq- ca 2 J 


(6.78) 


where a> r is the natural frequency of a subsystem that has only the rth mass present. The 
following examples show how the formula is used. 


Example 1 : Two-degree-of-freedom system 



Only mass #2 



2 


Figure 6.76 Two-degree-of-freedom system and its two subsystems 


The system on the left of Figure 6.78 has two masses, so there will be two subsystems, one 
with only mass 1 and the other with only mass 2. The two subsystems are also shown in 
Figure 6.76. 


Subsystem with only mass 1 When we remove mass 2 to create this subsystem, we also 
ignore the lower spring, since this is assumed to be massless. The natural frequency of the 
subsystem is given by: 
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Subsystem with only mass 2 Removing mass 1 leaves mass 2 supported by two springs 

k 

in series, each of stiffness k. These are equivalent to a single spring of stiffness — . The natural 
frequency of the subsystem is given by: 


k 



With two subsystems, Dunkerley’s formula will have two terms in the series on the right-hand 
side. Substituting the subsystem results into the formula, we get 


Hence, 



(6.79) 


As a numerical example, if m = 2 kg and k = 200 N/m, equation (6.79) gives the lowest 
natural frequency to be 0.918 Hz. The exact answer is 0.984 Hz. 


Example 2: Shaft with added masses 

The phenomenon of shaft whirl, and a method for calculating the 
whirl frequencies for plain shafts with uniform cross-sections, were 
presented in Section 6.2. While this could be a good approximation 
for the drive shaft on a vehicle, for example, it would not be 
appropriate for the steam turbine shaft shown in Figure 6.77. This 
carries several large-bladed discs that add significant mass to the 
system and affect its natural frequencies and hence its whirl speeds. 
Whirl can be avoided if the rotational frequency of the shaft remains 
below its lowest natural frequency in flexure and Dunkerley’s 
method is able to give an estimate of this. 

We will look at a simplified example consisting of a uniform shaft 
carrying three added masses. 

Figure 6.78 



Figure 6.77 Steam turbine shaft and discs 


1 


Subsystem with only mass 1 present is: 



Figure 6.79 


Since we are assuming that only mass 1 is present, we model the shaft 
Hence, the natural frequency for the system can be written as: 


“l 


( shaft stiffness at mass position, X] 
mi 


massless spring. 
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For a uniform, simply supported massless shaft, the stiffness at the position of mass 1 is 


t T 9 ohlL 

Hence, on = ?7 t \2 

1 m 1 x 1 z {L - x t y 

This can be repeated with mass 2 only, and with mass 3 only, to give o ) 2 2 and (O 3 2 . There is 
also a fourth system in which all three of the added masses are removed to leave only the mass 
of the shaft present. This is: 


A A 

Figure 6.80 


For a uniform, simply supported shaft, the first natural frequency is 


Combining all of the frequencies using Dunkerley’s formula gives: 


0)„ 2 OJ| 2 0) 2 2 W3 2 trig 2 


For all systems, Dunkerley’s formula always gives an underestimate of the true frequency. 
This is very useful, since it provides a conservative estimate for whirl calculations. Specifically, 
to avoid the possibility of whirl, the lowest critical speed (i.e. the lowest natural frequency) of 
the shaft must be greater than the top speed of the shaft. Therefore, if the top speed is below 
the value predicted by Dunkerley’s formula, it should also be below the actual critical speed. 


Rayleigh’s method 

Lord Rayleigh observed that for an undamped system vibrating freely at one of its natural 
frequencies, energy is conserved so that 

maximum kinetic energy = maximum strain energy 

This is the basis for his method. The strain and kinetic energies can be found for any structure, 
provided that we know the deflected shape (i.e. the mode shape). Since we do not normally 
know the exact mode shape, it is necessary to make an estimate of it. This is an important step 
since the accuracy depends on making a good guess. 

For systems with lumped mass and massless springs, the instantaneous kinetic energy for mass i is 

('T)mass i = 

For sinusoidal vibration, xfj) = X t sin a )t and x t (t) = mX, cos bit. Hence the maximum kinetic 
energy for mass i is 

(TmaxUss,- = = yw 2 m,X 2 

where Xy is the amplitude of vibration for mass i based on the assumed mode shape. 

The maximum strain energy in the spring 

(I/max) spring/ = y fey (maximum change of length ) 2 (6.81) 

The totals for the complete system are given by summing the contributions of all the masses 
and all the springs. Equating the strain and kinetic energies, and rearranging, gives an 
expression for a) 2 . The result can also be calculated from the mass and stiffness matrices for the 
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system. That is 

, _ {cl>} r [K]{4>} 

“ {c|>} r [M]{c|>} 

where {4>} is the assumed mode shape vector. 


Example 1 : Two-degree-of-freedom system 

From equation (6.78) the combined maximum kinetic energy for both masses is 
Tmax = jw 2 m 1 X 1 2 + j(D 2 m 2 X 2 2 
= j( 0 2 (m 1 X 1 2 + m 2 X 2 ) 

From equation (6.81) the combined maximum strain energy for the two springs is 


Umax = \hX 2 + \k 2 {X x - X 2 ) 2 


Equating T M ax ar >d Umax- we get 

, _ fet^t 2 + - X 2 f 

“» m x X x 2 + m 2 X 2 2 


(6.82) 


If we can estimate the mode shape, we will have values of Xj and X 2 that can be substituted 
into this equation. 

As a numerical example, take the case of m x = m 2 = 2 kg and = k 2 = 200 N/m. For 
the first mode of vibration, we would expect the two masses to vibrate in phase with each 
other and for X 2 > X x . Given that the springs have equal stiffness, we might guess that they 
both stretch by the same amount, which would suggest that X, = 1 and X 2 = 2 would he a 
reasonable estimate. 


Substituting these values into equation (6.82) gives a value for of 1.007 Hz. The exact 
answer for this problem is 0.984 Hz, so the error is 2.3%. 

The lowest mode shape often resembles the static deflection shape, so this is invariably a 
good ‘guess’. 

Here, noting that the bottom spring supports one mass, but the top spring carries the weight 
of both masses, the static extension of the top spring will be twice that of the lower spring. As 
a result, the static deflection shape is Xj = 2, and X 2 = 3. This gives co„ = 0.987 Hz, which is 
an error of only 0.4%. 

If we knew the exact mode shape, we would find that this gave the exact answer. 

Because Rayleigh’s method imposes a deflection shape on the system, it effectively constrains 
it to vibrate in a different way from the true mode shape. As a result, Rayleigh’s method will 
always give an overestimate of the natural frequency, unless you happen to guess the exact 
mode shape by good fortune. 

Thus, “Rayleigh “Exact 

The technique to adopt is to try several possible mode shapes. The lowest of the predicted 
frequencies will he the most accurate. 


Rayleigh’s method for shafts and beams 

Rayleigh’s method can also be applied to shafts and beams. In this case, the expressions for the 
maximum kinetic and strain energies are 

Tmax = ^“ 2 £ P A[Y(x)] 2 dx 

Umax = \ dx (6-84) 



X 2 

Figure 6.81 
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where Y(x) is the mode shape function, which defines the amplitude of vibration of the shaft/ 
beam along its length. Non-uniform cross sections (where A and I are functions of x) can be 
analysed, as can systems of interconnected beams, and systems that include discrete masses and 
springs. In each case, we sum the contribution of each element to the overall strain and kinetic 
energies. 

We need to guess Y(x) in order to evaluate the integrals. 


Example 1: Uniform cantilever beam 

The exact answer for this problem can be found by substituting the first 
equation given in Table 6.3 into equation (6.23). This gives: 


co„ = 


3.52 fW 
L 2 Vp A 


of the frequency 


The main criterion for choosing the mode shape is to ensure that it satisfies the displacement 
and slope conditions at the ends of the beam/ shaft. 


For a cantilever beam, Y = -p- = 0 at x = 0. The exact mode shape (see Section 6.2) is: 


sinXjL + sinhA.iL 

Ti(x) = sin Apt - sinhAi* - cosXlL + coshXlL («»M - cosh Ape) 

It would be impossible to ‘guess’ this function, particularly when we don’t know the natural 
frequency (which is needed to work out the wavenumber, Ai). 


Choice 1 Quadratic function Y(x) = Cx 2 

This expression satisfies the displacement and slope conditions at the clamped end and gives a 
shape that is similar to the actual mode shape. 

The maximum kinetic energy is given by 

W = ^ 2 J ( 'pA[Cx 2 ] 2 dx 

1 2 . v CL 5 
= 2 W_ P A X -g- 

, P ac 2 l 5 

= “ To 


The maximum strain energy 



| J EI(2C) 2 dx 
= 2 EIC 2 L 


Equating gives a 2 = 20—^4 

4.47 I El 

Prediction is = ~J 2 ~ V ^4 ’ w ^' c ^ i s significantly (27%) higher than the exact value. 


Choice 2 Static deflection shape Y(x) = cj^rj — 4 + 6^j J 
This expression gives w„ = an error of less than 1%. 
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Choice 3 A trigonometric function 

Suitable functions can often be found by inspection, avoiding the need to calculate the static 
deflection shape. For a cantilever beam, a suitable function would be: 

YM = l-cos(g) 

Using this, the prediction is oo„ = which is 4% high. 

Figure 6.82 compares each of the three approximate mode shapes with the exact expression. 
The quadratic function is the least accurate shape and gives the least accurate estimate of the 
natural frequency. Notice that there is greater bending in the quadratic function, resulting 
in an overestimate of the strain energy. Also the displacement amplitudes are less, resulting 
in an underestimate of the kinetic energy. Since we effectively divide the strain energy by 
the kinetic energy (albeit without the w 2 term), these two factors explain why the quadratic 
function gives such a high estimate. The static deflection shape is closest to the exact mode 
shape and this is why it gives the most accurate estimate of the natural frequency. The 
trigonometric function is a good compromise and can be found for different beams by 
inspection. 



cantilever beam 


Example 2: Beams and shafts with added masses 

A O B P 

Figure 6.83 

The added masses don’t change the strain energy, but they add extra kinetic energy. 

For mass at x = x p the maximum velocity is a) Y(x p ) 

Contribution of mass p to the kinetic energy is \m p [(j)Y(x p )] 2 

These contributions need to be added to the kinetic energy of the shaft itself. Hence, the total 
kinetic energy becomes 

T MAX = ^ 2 J o L pA[Y(x)] 2 dx+ J2 ^ 2 [Y(x p )] 2 

ALL MASSES 
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Single-degree-of-freedom dynamic models of complex systems 


The approach is based on the observation that if the total strain and kinetic energies of two different 
dynamic systems are identical, an exact analogue relationship will exist between the two systems. Such 
systems are called dynamically equivalent systems. 

In situations where the real system has many degrees of freedom, but where only one mode of 
vibration is of interest, the concept provides a powerful method for producing an approximate 
single-degree-of-freedom model to describe that mode of vibration. Not only are single- 
degree-of-freedom models easy to analyse, but examples in this section have confirmed that 
they can often give good predictions of the general behaviour of more complex systems. We 
will consider only undamped systems here. 

The approximate model consists of a simple mass-spring system, in which the displacement of 
the mass represents the displacement of some chosen point on the real structure. 

Using the concept of dynamically equivalent systems, the mass and spring stiffness of the 
approximate model are chosen so that the maximum strain and kinetic energies of real and 
model systems are the same. 


To do this, we assume that the lowest mode of vibration is dominant and therefore defines 
the deformation pattern in the structure. This is a good assumption if the system is vibrating 
sinusoidally near its lowest mode of vibration, but can also work well in other cases. An 
example of the latter was shown in Figure 6.67 where the response of a five-degrees-of- 
freedom system to a sequence of rectangular pulses was shown for comparison with that of an 
equivalent single-degree-of-freedom model. 

As with Rayleigh’s method, the accuracy of the model relies on having a good estimate of the 
real structure’s mode shape. 

Example 1 : Lumped mass system 

In this example, we will find an approximate single-degree-of-freedom model to analyse the 
motion of the top mass of the three-degree-of-freedom system shown in Figure 6.84. 

The first step is to establish a link between the displacement of the mass in the approximate 
single-degree-of-freedom model and some chosen point on the real system. In this case, the 
obvious choice is to link the displacement of the approximate model to the displacement of 
the top mass, x, since it’s the motion of this mass that we want to predict. 

However, since we are assuming that the motion of the real system is given by our chosen 
mode shape, not only does the approximate model predict the behaviour of the coordinate 
used as the link with the real system, but we can also work out the motion of the other 
coordinates since they are all linked by the assumed mode shape. In this example, it means that 
once we’ve found x(t) we can get y(t) and z(t), since each will be related to x(t) in proportion 
\X 

to the mode shape \ Y 

{ Z 

The approximate model is shown in Figure 6.85, with the motion coordinate chosen as x to 
match the coordinate of the top mass in the actual system. 

To find the values for the mass and stiffness in the model, we equate the maximum kinetic and 
strain energies in the real and approximate systems. 

Equating maximum kinetic energies in the real (LHS of the equation) and model systems 
(RHS of the equation), we get: 

j4(o>X) 2 + i2(wY) 2 + \2 (wZ) 2 = \m{ toX) 2 


Hence, 


4X 2 + 2Y 2 + 2Z 2 
X 2 


(6.85) 




Figure 6.85 
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Equating maximum strain energies in the real and model systems, we get: 


^200(X - Y) 2 + f 200(Y - Z) 2 + ^200 ■ Z 2 = \kX 2 


Hence, 


200 [(X - Y) 2 + (Y - Z) 2 + Z 2 ] 
X 2 


(6.86) 


To get values for m and k, we need an estimate for the mode shape. For the lowest mode 
shape, we expect all three masses in the real system to vibrate in phase with each other and, 
since the top mass is not connected to ground, that X > Y > Z. 


Choice 1 Since all springs have the same stiffness, we might guess that they all deflect by the 
same amount. This implies that the mode shape would be: 



Substituting these values into equations (6.83) and (6.84) gives, m = 5.11 kg and 
k = 66.7 N/m. 


Choice 2 The static deflection shape for this system is 


and substituting these values 


into equations (6.83) and (6.84) gives, m = 5.60 kg and k = 71.6 N/m. 


To decide which of these models is the more accurate, we can compare the natural frequencies 
predicted by each one. 



It can be seen that the second choice (the static deflection shape) is the more accurate since it 
gives the lower estimate of the natural frequency. 


Example 2: Forced response of a cantilever beam 


F cos 



Y(L, t) 


Figure 6.86 

In the second example, we will create a single-degree-of-freedom model of a uniform 
cantilever beam and use it to estimate the steady-state response at the free end due to a 
sinusoidal force having a frequency near the lowest natural frequency of the beam. 

There are two stages. First, we set up the approximate model and then we use it to do the 
steady-state response calculation. 

In this case, we choose to link the displacement of the mass in the approximate model to the 
displacement at the free end of the cantilever. In terms of the chosen displacement variables, 

z{t) = y(L, t) 

For steady-state, sinusoidal vibration, the link can be written as 

Z cos «f = Y(L) cos wt or Z = Y(L) 

To proceed, we need to choose an expression for Y(x), the amplitude of vibration for the 
cantilever. Two possibilities are considered. 



Figure 6.87 
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Choice 1 Y(x) = Gx 2 

Linking this with the approximate model, Z = Y(L) = CL 2 

Hence, C = so that Y(x) = ~^x 2 

This expression for Y(x) is then used to calculate the maximum kinetic and strain energies in 
the beam. By equating these to the equivalent expressions for the single-degree-of-freedom 
model, we get the required mass and stiffness values. 

1 nL 1 

For the kinetic energies, T MAX = — « 2 I pA[Y(x)] 2 dx = -xm(a 2 Z 2 

This gives the mass for the approximate model to be m = 0.2 pAL. Note that pAL is the mass 
of the beam. 

1 T L I d 2 Y\ 2 1 

For the strain energies, L' MAX = 2 J 0 ^ \ dx 2 / ~ 

El 

This gives the spring stiffness for the model to be k = 4 yy 

4 47 FeT 

Applying the natural frequency test to the approximate model, we find that «, = ^ 

which is the same (poor) result obtained with Rayleigh’s method using this choice for Y(x). 
Choice 2 Static deflection shape, Y(x) = c[(|() “ ^(l) + ^(l) ] 

Linking this expression at x = L with the coordinate for the approximate model, we get 
Z = Y(L) = C[l 4 - 4 X l 3 + 6 X l 2 ] = 3C 

Hence, C = f and Y(x) = f [(f ) 4 - 4(f) 3 + 6(f)']. 

Using this expression to calculate the mass and stiffness values for the model gives the 
following. 

m = 0.257 pAL and k = 3.2o|j 

The natural frequency test in this case gives oj, = > which is lower than the result 

from the first choice, confirming that the static deflection shape is the better approximation. 


Forced response analysis 

Having established the single-degree-of-freedom model, we can now use it to estimate the 
steady-state response at the free end of the beam due to a sinusoidal force. The equation of 
motion for the approximate model is: 

mz + kz = F(t) 

Making the standard substitutions, F(t) = Fe'“' and z(t) = Z*e 1M( , we get 

7 * = E 

(k - mco 2 ) 

Note that since there is no damping in this model, the expression for Z* is real. 

This meets the objective of finding the steady-state amplitude of the deflection at the free end 
of the cantilever. However, since we have assumed that the deflected shape of the beam can 
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be defined by the mode shape (that is, the function Y(x)), the expression for Z* can also tell us 
the vibration amplitude at any point along its length. In the case of choice 2, this gives: 




' 3(fc - 


Learning summary 

By the end of this section you should have learnt: 

y to use Dunkerley’s method and Rayleigh’s method to obtain estimates of the lowest natural 
frequency of structures; 

y to create a single-degree-of-freedom approximation of a more complex structure and use it to 
estimate the structure’s response. 


6.7 Vibration control techniques 


Introduction 


Vibration isolators (also known as anti-vibration mounts) are used for reducing the 
vibration transmitted from a source. They work by introducing flexibility between a 
device and its support. 


Case 1 In some cases, the source of vibration is within 
the device and the objective is to minimize the force 
transmitted to the support. Examples are a ship’s 
engine that can transmit vibration to the deck 
structure or a passing train that can produce 
ground-borne vibration. 

Case 2 In other cases, the source may be remote, but 
causes the support for a device to vibrate. Here, 
the objective is to minimize the displacement transmitted 
to the device. Examples are a satellite mounted in 
a launch vehicle or the need to protect sensitive 
laser instruments from ground-home vibration. 



launch 
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Figure 6.88 Satellite isolation 
system 


Structural vibration 


Types of isolator 

Elastomeric isolator 

This is the most common type of isolator. Elastomers can be moulded from many different 
combinations of many different materials, including natural rubber, neoprene, butyl and 
sibcone. A typical mount made with these materials generally employs the elastomer in shear 
but many utilize compressive strain also. The mounts may employ the elastomer in a manner 
that provides both shear and compressive loading for effective isolation performance in both 
the horizontal as well as the vertical direction. It is relatively easy to design various degrees 
of damping, shape, load— deflection characteristics and transmissibility characteristics into 
elastomeric isolators. The inherent damping of elastomers is useful in preventing problems at 
resonance that would be difficult to restrain if coil springs were used. 

For isolation from shocks, elastomers offer some significant advantages because of the fact that 
they can generally absorb more shock energy per unit weight than other forms of isolator 
system. 

Pneumatic vibration isolators 

Pneumatic isolators are air-filled, reinforced rubber bellows with mounting plates on top 
and bottom. Isolators such as these can provide very low natural frequencies with small static 
deflections. To provide a 1 Hz natural frequency, a steel coil spring isolator would need to 
be about 600 mm long and capable of deflecting about 250 mm. It would thus be difficult to 
install and would also present some lateral stability problems. Unlike most isolators, which 
are passive devices, pneumatic isolators are also used with position feedback in active control 
systems. This is used, for example, to maintain the height of a table for mounting optical 
equipment that is sensitive to the slightest movement. 



Figure 6.89 


Coil spring isolators 

Springs may be loaded in tension but it is frequently more convenient to load them in 
compression. Coil springs are used primarily for the isolation of low-frequency vibration. 
Consequently, they operate with a relatively large static deflection and lateral stability may he a 
problem. It can be shown that a coil spring will be stable if 

lateral stiffness > static deflection 
axial stiffness working height 
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Coil springs possess practically no damping, and the transmissibility at resonance is extremely 
high. This can be overcome by the addition of friction dampers in parallel with the load-carrying 
spring and these types of isolators are widely used. Another method of adding damping to a 
spring is by the use of an air chamber with an orifice for metering the airflow. For applications 
where all metal isolators are desired because of temperature extremes or other environmental 
factors, damping can be added to a load-carrying spring by the use of metal mesh inserts. 

High-frequency vibrations can be transmitted through the coils to the isolated unit. To overcome 
this, one or both ends of the spring can be fitted with elastomeric pads. Conventionally, the pad 
is attached to the bottom of the spring assembly, which has the added advantage of providing a 
non-slip surface that frequently eliminates the need to bolt the isolator to the floor. 


The isolators are invariably very much more flexible than the device they support, so the first 
approximation is to use a single-degree-of-freedom model in which the device to be isolated 
is treated as a rigid body and the isolators are represented by a spring-damper combination. 
The steady-state response to harmonic excitation provides a way of characterizing the isolation 
performance at different frequencies. 

Case 1 Source of vibration within a device transmitting vibration to the support 

We will assume that the device generates an excitation force, of amplitude P and frequency « 
and use the method from Section 6.2. 

Step 1 Dynamic mass— spring model 


Transmissibility analysis 


Figure 6.90 


/ 



Step 2 Free-body diagram 



k 


Figure 6.91 


Transmitted 
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Step 3 Equations of motion 

For the device: p — kx — cx= mix 

or mix + cx + kx = p (6.87) 

The transmitted force is q(t) = kx + cx (6.88) 

Substitute p(t) = Pe 1<at ,q(t) = Q*e lt0 ‘, and x(t) = X*e'“‘ into equations (6.87) and (6.88) to 
give 

X* = 71 — and Q* = (k + i c(a)X* 

(k — Old)') + ICO) v ’ 


EHminating X we get 


Q* _ (fe + ica>) 

P (k — mco 2 ) + icco 


Here, only the magnitude of the transmitted force is of interest and 

transmissibility as 


I Q* I I (fe 2 + cW) 

Tp “ |”P"| “ Y (k - mo) 2 ) 2 + c 2 o) 2 


define force 


(6.89) 


Case 2 Vibration from the support transmitted to the device 

The support vibration is defined by the displacement, y(t) = Y cos (at. 


Step 1 Dynamic mass— spring model 



Figure 6.92 


Step 2 Free-body diagram 

k(y - x ) c(y - x) 


\ \ 



Figure 6.93 
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Step 3 Equations of motion 

For the device: k(y — x) + c(y — x) — nix 

or mx + cx+ kx = cy + ky (6.90) 

Substitute y(t) = Ye I0W and x(t) = X*e IMt into equation (6.90) to give 

* _ (k + i«rt)Y 
X (k — mm 2 ) + icco 

Again, only the magnitude of the transmitted displacement is of interest and we can define 

displacement transmissibility as 


I (fe 2 + cW) ~ 

D I Y I Y (k - rmo 2 ) 2 + Ao 2 


(6.91) 


Note that the force and displacement transmissibility expressions for this mass— spring- 
damper system are identical. The same is true of other physical systems when analysing the 
displacements or forces applied to a given pair of coordinates (the mass and the support in this 
case). Note also that equations (6.89) and (6.91) only apply to this mass-spring— damper system. 
Other physical systems will have different transmissibility expressions. 


The reader is left to confirm that equations (6.89) or (6.91) can be rearranged by introducing 

f~k 

the expressions for the natural frequency and damping ratio for the system, namely o)„ = J — 


and y = 


2 \[hn' 


The alternative expression for the transmissibility r; 


^d.f 



(6.92) 


This form of the transmissibility expression emphasizes the importance of the ratio of the 
excitation frequency to the natural frequency in determining the effectiveness of the isolation 
system in reducing vibration. 



— Damping ratio 0 

— Damping ratio 0.2 

— Infinite damping 


Frequency ratio 

Figure 6.94 Transmissibility curves for a simple mass-spring-damper system 


The transmissibility graphs in Figure 6.94 are plotted for three different damping ratios. With 
zero damping, the transmissibility is infinite if the excitation frequency coincides with the 
natural frequency of the system. The main effect of increasing the damping ratio (y = 0.2 is 
shown in Figure 6.94) is to reduce the maximum transmissibility in the vicinity of the natural 
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frequency. At higher frequencies, the difference between the curves is small. Theoretically 
infinite damping would result in no relative movement across the damper, effectively giving a 
rigid connection between the device and its support. This, of course, is the original situation 
without isolators. 

It’s easy to show from equation (6.92) that T = 1 when m/w, = \[2 and that it is independent 
of the value of the damping ratio. This explains why the three curves intersect at this point on 
the graph. 

Notice that if «/ < \/2, the transmitted displacement (or force) is higher than the input. 

In this case, it would be better to have no isolators and fix the device rigidly to the support! 
However, if w/ u>„ > -[2, the transmissibility is less than 1.0, resulting in vibration reduction. 
The aim in selecting isolators is to ensure that the system operates in this isolation region. 

It is convenient to define an isolation efficiency to describe the reduction in displacement (or 
force) compared with the applied displacement (or force). This is: 


^ Reduction in displacement (or force) 

Isolation efficiency = — . , . . — ; — 7 ; — 

Original displacement (or force) 


X 100% = (1 - T) X 100% 


Isolator selection 

This section considers the task of selecting isolators to achieve a desired reduction in vibration. 
There are two constraints governing the selection: the lowest excitation frequency to be 
encountered, w MIN , and the maximum allowable transmissibility, I’max- 

The combination of Tmak and ^ IISJ is marked on Figure 6.95 and represents the worst 

case in respect of isolation efficiency, since for any excitation frequency greater than w„, the 
transmissibility will be lower than T MAX - It is apparent from Figure 6.95 that o)„ must be very 
much less than w MIN to ensure that T MAX will be less than 1.0 for all excitation frequencies 
greater than or equal to w MIN . 



Frequency ratio 
Figure 6.95 Design constraints for isolator selection 


Three variables affect the system’s dynamics; m, k and c, and m and k together determine w„. 
The stiffness, k, is given by the set of isolators selected. 

At first sight, the mass of the device may not seem to be a variable for this problem. However, 
m can be increased by mounting the machine on an inertia base. This will reduce a>„ and 

increase ^' N and have the beneficial effect of reducing the transmissibility. Figure 6.96 gives 
an example of an inertia base installation. 
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] h A 


Inertia base 



| | | | | 



Foundation 



Figure 6.96 Example of inertia base used to increase the isolated mass 


In most vibration situations, it is desirable to increase damping since this limits the amplifying 
effect of resonance. Here, since the system is designed to operate well above the resonant 
frequency, low damping is desirable since increasing y will increase the transmissibility in 
the isolation region. Low damping is easy to achieve and most commercial isolators give a 
damping ratio that is less than 0.1. 

It is normal to base the design on the assumption of zero damping and, as can be seen 
from the transmissibility graph in Figure 6.94, the error involved in the isolation region is 
small. 


It is also normal to treat each isolator independently of the others. In this case, m is the effective 
mass supported by the isolator in question. 


For the simple mass— spring model with zero damping, 



for the isolation region where « > (o n . 


Since = — , the required isolator stiffness is 


k = mu> n 2 


wTmax^min 2 

1 + r MAX 


(6.93) 


If selecting isolators from a manufacturer’s catalogue, it is unlikely that one with precisely this 
stiffness will be found. The stiffness given by equation (6.93) therefore gives the maximum 
value consistent with the design requirements. 

It might appear from this that any isolator would be suitable provided its stiffness was less than 
this value. This is not the case, however, since there are also constraints imposed by static 
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considerations. In the case of coil spring isolators, there could be installation, coil bottoming or 
lateral stability problems if the static deflection is too large. With elastomeric isolators, there are 
strength limitations under static load. 

Manufacturers often express these constraints by specifying a maximum static deflection. 
Therefore, after selecting an isolator to satisfy the maximum stiffness limit, it is necessary to 
check that the static deflection limit is not exceeded. 


The actual static deflection, X 0 , is given by 


X 0 


mg 

^isolator 


Alternatively, combining (6.93) and (6.94) gives 




g 

U MIN 


(6.94) 


(6.95) 


This represents the minimum static deflection consistent with the design requirements. 


These equations can be incorporated into a procedure for selecting suitable isolators for an 

application. The procedure is as follows: 

Step 1 Find the centre of mass of the machine. 

Step 2 Select the number and position of attachment points for isolators. 

Step 3 Estimate the load supported by each isolator. 

Step 4 For each isolator position in turn: 

(a) calculate the maximum stiffness from equation (6.93); 

(b) select an isolator with a lower stiffness; 

(c) check that this does not exceed any static deflection limit using equation (6.94); 

(d) although this will give a satisfactory selection, it is often worth repeating (b) and 
(c) with other isolators having even lower stiffness; the lower the stiffness, the 
greater the isolation efficiency, so the limiting factor becomes the maximum 
allowable static deflection. 


Tuned vibration absorbers 

Tuned vibration absorbers were mentioned in Section 6.1 as part of the solution to the 
resonant vibration on the Millennium Bridge. They were also mentioned in Section 6.4 where 
it was observed that it was possible to design a dynamic system that has zero displacement at 
the point of force application. This section explains how this can he achieved. 

Let’s look at the case of a structure that is subjected to a sinusoidal force whose frequency 
is very close to a natural frequency and consider the worst-case scenario that the structure 
has no damping to limit the resonant amplitude. We can model the mode in question as 
the undamped single-degree-of-freedom system shown in Figure 6.97(a). For the original 
system without the tuned vibration absorber, the response amplitude is given by equation 
(6.45) as 


v - 1 

X °~\k~ mco 2 I 

As a numerical example, we will take k = 1 MN/m, m = 1000 kg and Pj = 1 kN to get 
the response spectrum shown in Figure 6.97(b). As expected, the model predicts a very large 
response when the excitation frequency is close to the natural frequency. The aim of adding 
the tuned vibration absorber is to reduce this response to zero. 
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The tuned absorber itself is a secondary mass— spring system in which the ratio of stiffness to 
mass is identical to the original system. This is attached to the original mass as shown in Figure 
6.98(a). In the numerical example shown here, the ratio of the absorber mass to the original 
mass, |x, has been taken to be 0.01. 



This can be analysed in exactly the same way as the worked example in Section 6.4 to give 
expressions for the response of each mass. It is seen from Figure 6.98(b) that the response 
amplitude Xj is zero when the excitation frequency is equal to the original natural frequency. 
The effect of adding the tuned absorber is to create a two-degrees-of-freedom system whose 
natural frequencies He either side of the natural frequency of the original system. A limitation 
of this absorber is that it is only effective if the excitation frequency is constant and at a 
frequency very close to the natural frequency of the original system. If the excitation frequency 
deviates from this, there is a strong chance of exciting one of the new natural frequencies, 
since it can be seen from Figure 6.98(b) that these are only about 0.5 Hz (10%) away from the 
original natural frequency. The separation of the new natural frequency can be increased by 
choosing a larger value for the mass ratio, |x. However, there may be limits to the amount of 
mass that can be added, as for example in the case of the Millennium Bridge (Section 6.1). 
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This problem can be overcome by using a damped vibration absorber that uses a damper in 
parallel with the absorber spring, as in Figure 6.99(a). Note that the original structure remains 
undamped. The response spectrum with the damped absorber is shown in Figure 6.99(b) using 
a damping coefficient of 100 Ns/m. This has greatly reduced resonant peaks and still gives a 
very low response at the original natural frequency. 



Figure 6.99 


o 



The modal approach to solving the equations of motion cannot be used for this problem since 
they cannot be uncoupled. The equations of motion are: 

I” 7]{*} + [% rt ‘mId-W 

It will be seen that the damping matrix is not in the form [C] = a\[M\ + a 2 [K], meaning that 
we do not have proportional damping in this case. Since the equations of motion cannot be 
uncoupled, they must be solved as a simultaneous pair. For sinusoidal excitation, we can still 
make the usual substitutions, namely: p\(t) = //c"”', x\(t) = X*e l0W and x 2 (t) = X 2 e mt . This 
leads to the equations: 


(fe(l + p) - mw 2 ) + iwc -iLk-ux K IfXll fPll 

-pfe - i«c (pfe - pmw 2 ) + iwcj \x* 2 \ 1 0 J 

The responses are then given by 


jx*l f(fe(l + p) - mco 2 ) + iw 

c -pfe-iwc r 

[X 2 J [ — pfe — iwc 

(pfe — pmw 2 ) + iwc J 


In practice, this problem would be solved using software such as Madab, which has the 
capability to invert complex matrices. A Matlab script based on this example is included on the 
book’s supporting website. 

The most common application of tuned vibration absorbers is for overhead power lines. When 
wind blows across the cables, vortex shedding can induce vertical oscillations, which can cause 
fatigue in the cable. This is most prevalent where the cable is supported by the transmission 
pylons, and vibration absorbers are attached here to suppress the oscillations. 


455 



An Introduction to Mechanical Engineering: Part 2 



Figure 6.100 Tuned vibration absorber 
for overhead power lines (circled) 


The absorber consists of a pair of masses, each one on the end of a short cantilever made from 
cable. Bending of the cantilevers provides the required spring stiffness, and friction between 
the cable strands provides damping. 


Learning summary 

By the end of this section you should have learnt: 

S how to derive the transmissibility expression that gives the ratio of transmitted force to applied 
force (or of transmitted displacement to applied displacement) as a function of excitation 
frequency; 

S to select suitable isolators to achieve a given isolation efficiency; 

S how tuned vibration absorbers can suppress resonant vibration in situations where sinusoidal 
excitation coincides with a natural frequency of the structure. 
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Fluid Dynamics 

1 . Show that the two-dimensional velocity field 
given by u = ay and v = bx satisfies the continuity 
equation. Using the Navier-Stokes equations, derive 
the expression for the pressure p. 

2 . The three-dimensional velocity profile for an 
incompressible flow is given by 

u = x 3 + 2 z 2 and w = y 3 — 2 yz 
Derive a general form of the third velocity 
component, v from the 3-D continuity equation: 
du dt' dw _ _ 

dx dy dz ~ U ' 

3 . By neglecting the viscous and gravity terms, derive 
the Bernoulli equation from the steady Navier- 
Stokes equations. (Hint: choose the x-axis along the 
streamline.) 

4 . For an axial flow inside a circular tube, the Reynolds 
number which causes the transition to turbulence is 
approximately 2300, based on the tube diameter and 
average flow velocity. If the tube diameter is 4 cm 
and the fluid is petrol at 20°C, find the volumetric 
flow rate which causes the transition. The density and 
viscosity of the petrol are p = 680 kg/ m 3 and 

|x = 2.92 X 10 -4 kg/ ms, respectively. 


7 . A flow straightener is formed from perpendicular 
flat metal strips to give 25 mm square passages, 
which are 150 mm long. Water of kinematic 
viscosity 1.21 X 10 -6 m 2 /s approaches the 
straightener at 1.8 m/s. Neglecting the thickness 
of the metal, calculate the displacement thickness 
of the boundary layer and the velocity of the main 
stream at the outlet of the straightener. Using 

the Bernoulli equation, deduce the pressure drop 
through the straightener. 

8 . Atmospheric boundary layers are very thick but 
follow formulae very similar to those of the flat- 
plate boundary layer. Consider the wind blowing 
at 10 m/s at the height of 80 m above a smooth 
beach, which is considered to be the thickness of the 
atmospheric boundary layer. Estimate the wall-shear 
stress on the beach ground. What will be the wind 
velocity over your nose if: 

(a) you are standing up 1.7 m off the ground? 

(b) you are lying on the beach with your nose 
0.17 m above the ground? 

9 . An aircraft travels at a cruising speed of 250 m/s at 
10 km altitude. It has a smooth wing 7 m long and 
55 m wide, which can be approximated by a flat 
plate. Estimate the power required to overcome the 
friction drag of this wing. 


5 . Determine the shape factor of the laminar boundary 
layer when the velocity profile is given by: 


6 . A smooth flat plate 2.4 m long and 900 mm wide 
moves at 6 m/s through still air. Assuming that the 
boundary layer is laminar, calculate the boundary 
layer thickness at the end of the plate, the wall- 
shear stress halfway along, and the power required 
to move the plate. What power would be required 
if the boundary layer is ‘tripped’ to turbulent at the 
leading edge? 


10. A man weighing 800 N jumps from an aircraft 
flying at 1000 m altitude. The parachute s diameter is 
8 m and the time of descent is 2.9 minutes. Calculate 
the drag coefficient of the parachute, assuming that 
terminal velocity is attained within the first few 
seconds of the fall. 

1 1 . An electric transmission line consists of aluminium 
cable of density 2670 kg/m 3 with 15 mm diameter. 
The maximum cross-winds expected on the line are 
1 50 km/h. Calculate the magnitude and direction of 
the maximum loading on the line. 
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12. A four-cup wind speed indicator of 0.7 m diameter 
has a rusty centre bearing and the starting torque is 
very high. These cups are made of hemispherical 
shells of diameter 50 mm. Determine the starting 
torque of this instrument if a free stream velocity of 
4 m/s is required to initiate the rotation. You can 
ignore the connecting rods and side winds on the 
cups. 

13. A 60° cone with a 30 mm base diameter is mounted 
on a 200 mm long and 5 mm diameter rod, with its 
apex pointing to the free stream of 45 m/s. Estimate 
the total drag on this structure and the bending 
moment at the root of the rod. 

14. Icebergs can be driven at substantial speeds by the 
wind. Let the iceberg be modelled by a large flat 
cylinder, with its diameter D much greater than 
the height L. Only the top l/8th of the iceberg is 
exposed above the sea surface. Derive the expression 
for the steady iceberg speed V when it is driven by 
the wind of velocity U. 

15. A horizontal circular duct with a 0.3 m diameter, 

20 m in length draws air at atmospheric pressure 
into a centrifugal fan, which discharges the air into 
a rectangular duct 0.20 m X 0.25 m, 50 m long and 
to the atmospheric pressure. Show that the total 
pressure rise across the fan is 700 N/m 2 .The flow 
rate of the fan is 0.5 m 3 /s and the friction factor/ 
of the duct is 0.04.You should account for 

the minor losses at the inlet and outlet of the 
duct. 

16 . Kerosene of relative density 0.82 and kinematic 
viscosity 2.3 X 10 _6 m 2 /s is to be pumped through 
185 m of galvanised iron pipe ( e = 0.15 mm) at 40 
htres/s into a storage tank. The pressure at the inlet 
end of the pipe is 370 kPa and the liquid level in 
the storage tank is 20 m above that of the pump. 
Determine the size of pipe necessary, neglecting 
losses other than those due to pipe friction. 

1 7. The viscous sublayer is normally less than one 
per cent of the pipe diameter, therefore it is very 
difficult to probe it with a finite-sized instrument. 
To generate a thick viscous sublayer, one can use 
glycerine as a flow medium. Assume a smooth 
305 mm diameter pipe, 12.2 m long, with the bulk 
velocity of 18.3 m/s and the flow temperature of 
20 °C. Compute the sublayer thickness of the 
pipe flow and the power required if the pumping 
efficiency is 70 per cent. 


18. Ethyl alcohol at 20°K flows at V h = 3 m/s through 
0.1 m diameter drawn tube. Obtain: 

(a) the head loss per 100 m of tube; 

(b) the wall-shear stress; and 

(c) the local velocity at r = 20 mm. 

By what percentage is the head loss increased due to 
the surface roughness of the tube? 

19. A ship travels at a constant speed at sea surface. A 
propeller is required to propel this ship to maintain 
the steady motion, so that the thrust R from this 
propeller exactly matches the resistance of the 
ship. Using the TT-theorem, derive a functional 
relationship for R and other relevant physical 
quantities. 

20 . Instead of the ship in Question 19, consider a 
submarine cruising below sea surface. How would 
this modify your answer? Over the submarine 
surface there are a hatch cover, hand rails etc. How 
could these he considered in a dimensional analysis? 

21 . Derive a relationship for the power to rotate a disk 
at a constant speed in a homogeneous fluid as a 
function of relevant physical quantities. 

22 . The flow through a circular pipe may be metered 
from the speed of rotation of a propeller, having its 
axis along the pipe centreline. Derive a relationship 
between the volumetric flow rate and the rotational 
speed of the propeller in terms relevant physical 
quantities. 

23 . A propeller of 75 mm diameter, installed in a 

150 mm diameter pipe carrying water at 42.5 htres/s 
was found to rotate at 20.7 rps. If a geometrically 
similar propeller of 375 mm diameter rotates at 
10.9 rps in air through a pipe of 750 mm diameter, 
estimate the volumetric flow rate. 

24 . A torpedo-shaped object with a 900 mm diameter is 
to move in air at 60 m/s. Its drag is to be estimated 
from tests in water using a half-scale model. 
Determine the necessary speed of the model and the 
drag of the full-scale object if the measured drag of 
the model is 1140 N. 

25 . Tests on a 371.3 mm diameter centrifugal water pump 
at 2134 rpm yield the data given below. What is the 
Best efficiency point (BEP) and the specific speed of 


this pump? Estimate the 

maximum discharge. 


Qm 3 /s 

0 

.0566 

.113 

.170 

.227 

.283 

H m 

104 

104 

104 

101 

91.5 

67.1 

blip kW 

101 

120 

154 

191 

248 

248 
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Questions 


26 . A pump from the same family as in Questions 25 
is built with D = 457.2 mm and n = 1,500 rpm. 
Find the discharge, water pressure rise and the brake 
horsepower at BEP. 

27 . A large centrifugal pump with a non-dimensional 
specific speed of 0.183 revolutions is to discharge 
2 m 3 /s of liquid against a total head of 15 m.The 
kinematic viscosity of the liquid may vary between 
three and six times that of water. Determine the 
range of speeds and test heads for a one-quarter 
scale-model investigation to be carried out in water. 


Thermodynamics 

Refrigeration 

1 . (a) What pressure is required in a freezer 

compartment evaporator coil using refrigerant 
R134a if the temperature is to be maintained at 
-20°C? 

(b) What is the enthalpy and entropy when the 
R134a is: 

(i) all liquid? 

(ii) all vapour? 

(iii) What does the change in entropy between 
the two states tell you? 

(c) Assuming a mass flow rate of R134a of 20g/s, 
and making use of the SFEE, what heat transfer 
rate occurs in the evaporator if the refrigerant 
leaves at saturated gas condition (i.e. all vapour, 
with no superheat) and enters at saturated liquid 
condition (i.e. all liquid and no sub-cooling)? 

Air conditioning 

2 . An air conditioning unit draws in 0.5kg/s of 

atmospheric air at 30°C and specific humidity 0.012. 

(a) What is the mass flow rate of water vapour in 
the air? 

(b) Using the formula relating specific humidity 
to partial pressure of the water vapour and 
the atmospheric pressure, and assuming one 
atmosphere atmospheric pressure, what is the 
partial pressure of the water vapour? 

(c) Find the p g of water vapour. What is the relative 
humidity of the incoming air? 

(d) Find the dew point corresponding to the 
pressure p g . Sketch a T— v diagram showing the 
process that the water vapour in atmospheric air 
must go through in order to form dew. If the 
air is to be issued at 20°C and 50% humidity, 
determine the moisture removal rate, and the 
heat transferred in the cooler and heater sections. 


Vapour power cycles 

3 . A turbine receives steam at 550°C and 200 bar. 

(a) Assuming an isentropic expansion through the 
turbine to a pressure of 40 bar, what is the final 
temperature? 

(b) The turbine actually expands to 40 bar and 
330°C. What is the isentropic efficiency of that 
turbine? 

4 . A power station uses the Rankine cycle between a 
boiler with feedwater at 200 bar and a condenser 
which is at a pressure corresponding to the saturated 
water vapour at 30°C. 

(a) What are the pressure and specific enthalpy in 
the condenser? 

(b) What are the temperature and specific enthalpy 
of the saturated steam? 

(c) If the mass flow rate of the water is 100 kg/s, 
what is the heat input to the steam in the 
boiler? 

(d) What is the work done in the pump? 

(e) The Rankine cycle is modified with a superheat 
steam pipe circuit prior to entering the high 
pressure turbine. The cycle is also modified with 
reheat after exit from the high pressure turbine 
up to 550°C, before exhausting in the low 
pressure turbine at the pressure of the condenser. 
Given the isentropic efficiency of the low 
pressure turbine is 85%, what are: 

(i) specific enthalpy at turbine entry? 

(ii) specific enthalpy at turbine exit? 

(iii) the steam quality at exit? 

(f) Knowing the pump work, the work produced 
by the high pressure turbine and the low 
pressure turbine work, what is: 

(i) specific steam consumption? 

(ii) work ratio? 

Combustion chemistry 

5 . (a) Write the reaction equation for the 

stoichiometric combustion of butane in oxygen. 

(b) If the mass of butane consumed is 20 g, how 
many moles of butane is this? 

(c) For this 20 g of butane, what mass of 0 2 is 
consumed in the reaction? 

(d) How many moles of oxygen is this? 

(e) What are the masses of the gases in the 
products? 
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6 . (a) Write the stoichiometric reaction equation for 

butane in air. 

(b) What is the air to fuel ratio by volume and by 
mass? 

(c) What are the proportions by mass and by 
volume of the product gases? 

(d) Determine how much air is required for the 
stoichiometric combustion of propane burning 
in air. Determine the AFR by mass and by 
volume. 

(e) The propane is burned with 30% excess air. 
Determine the volume and mass fractions of 
reactants and products. 

Heat transfer 

7 . (a) What is the thermal resistance of a brick wall 

(k = 1.5W/mK), 10 cm thick, on a room wall of 
height 2.3 m and width 3.1m? 

(b) Calculate the thermal resistance of the wall given 
that it has three layers: brick, insulation, brick. 
Each layer is 10 cm thick, k = 1.2W/mK for the 
inner brick, 0.5W/mK for the insulation and 
2W/mK for the outer brick. 

(c) Given that the convective heat transfer to the 
wall’s inner surface is 9 W/m 2 K, and on the 
outer wall is 95 W/m 2 K, find the thermal 
resistance for the inside and outside convective 
conditions, and hence for the overall wall. 

(d) What is the heat transfer if the inner air is at 
18°C and the outer air is at 1°C? 

(e) Sketch the temperature profile through the wall. 

(f) The ground at night has air blowing over it at 
2°C, with a heat transfer coefficient of 
20W/m 2 K. It also sees the night sky, which is at 
— 55°C.The view factor is 1 and the emissivity 
of the ground is 0.9. Given the ground has 

a temperature of 10°C at a depth of 3m and 
conductivity of 2 W/m 2 , calculate the surface 
temperature of the ground. 

Combustion energy 

8 . Calculate the energy release by combusting lOg of 
butane (C 4 H 10 ) given the enthalpy of formation of 
liquid butane at standard conditions is — 147.6kJ/mol. 
Assume the products contain water as vapour. 

9 . Butane is burned by a stove with 100% excess air at 
25°C to heat up a pan of water. If the combusted 
gases escape at 800K, how much heat is extracted 
from the flame per kmol of fuel? 


10. What mass of butane is required to heat up 400g of 
water from 25°C to boiling point? 

Heat exchangers 

11 . (a) A heat exchanger operates with an oil with c p of 

1.67 kJ/kgK and density 910 kg/m 3 , and water 
with c p of 4.2 kJ/kgK and density 1000 kg/m 3 . 
The oil volume flow rate is 3158 1/h and the 
water flow rate is 2000 1/h. What is the capacity 
rate of the oil and of the water? 

(b) The water enters at 50°C and leaves at 70°C and 
the oil enters at 120°C and leaves at 85°C.The 
heat exchanger is of the shell and tube type with 
counter-current flow. What is the logarithmic 
mean temperature difference? 

(c) What is the surface area of the heat transfer 
surface within the heat exchanger if it 

has an overall heat transfer coefficient of 
1100W/m 2 K? If this is done with a tube having 
diameter 12 mm, what is the length of tube in 
the exchanger? 

12. A compact heat exchanger is constructed with 
a cross-flow matrix arrangement with unmixed 
streams for cooling oil in an air stream. The oil 
enters at 70°C and is required to leave at 30°C.The 
oil density is 700kg/ m 3 . The flow rate of oil is 5 
litres per minute and it has a specific heat capacity 

c = 1.7 kJ/kgK at 10°C and 2.5 kJ/kgK at 100°C. 

(a) What is a suitable average specific heat capacity 
of the oil over the working range? 

(b) What is the rate of heat transfer? 

13. A compact heat exchanger has oil exchanging 
heat with water and is used to cool engine oil. The 
oil mass flow rate is 50 g/s and c p = 1800 J/kgK. 
The water mass flow rate is 60 g/s. Given that the 
ingoing oil is at 130°C and the ingoing water is at 
15°C: 

(a) What are c mjn and c max for this situation? 

(b) What is the maximum potential heat transfer in 
this situation, q max ? 

(c) What is the NTU if the area is 0.5 m 2 and the 
overall heat transfer coefficient is 800 W/m 2 K? 

(d) What is the effectiveness, £, and hence the actual 
heat transfer? 

Compressors 

14 . A two-stage reciprocating air compressor, running 
at four cycles per second, delivers air at 18 bar from 
atmospheric air at 1.03 bar at ll°C.The dimensions 
of the first stage are 100 mm diameter and 100 mm 
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stroke, with a clearance length of 5 mm. Taking 
the polytropic index as 1 .25 for compression and 
expansion processes: 

(a) What is the swept volume and clearance volume 
of the first stage? 

(b) What is the intermediate pressure for minimum 
work? 

(c) What is the expansion of the clearance volume 
in the first stage? 

(d) What is the volumetric efficiency and hence the 
mass flow rate of air? 

Solid Mechanics 

1 . A helicopter rotor shaft, 50 mm in diameter, 
transmits a torque of 2.4 kNm and an upward tensile 
lifting force of 125 kN. Determine the maximum 
tensile stress, maximum compressive stress and the 
maximum shear stress in the shaft. 

2. For the purposes of analysis, a segment of a 
crankshaft in a vehicle is represented as shown 
in Figure Ql.The load is P = 1 kN, and the 
dimensions are iq = 80 mm, b 2 = 120 mm and 

= 40 mm. The diameter of the shaft is 
d = 20 mm. Determine the maximum tensile, 
compressive and shear stress at point A, located on 
the surface of the shaft at the z axis. 



3. Using the Tresca and von Mises yield criterion in 
turn, calculate the pressure to cause yielding in a 
steel cylinder, which has an 80 mm diameter and is 
1 mm thick. The cylinder is closed at each end; end 
effects should be neglected. Assume cr y = 250 MPa. 

4. What additional torque can be applied about the 
axis of the cylinder in question 3 if yielding is to 
occur with an internal pressure of 4.0 MPa. 


5. A steel beam of rectangular section 10 mm X 30 mm 
is subjected to pure bending in a plane parallel to the 
30 mm faces. Ideal elastic-plastic behaviour may be 
assumed. Calculate the bending moment necessary for: 

(a) the onset of yield; 

(b) the complete yield through the section. 

Assume a y = 250 MPa. 

6. The web and flanges of a straight I section steel 
beam are 80 mm wide and 10 mm thick. The beam 
is loaded in pure bending in the plane of the web 
until the whole of each flange has yielded but the 
whole of the web remains elastic. Calculate the 
residual curvature in the unloaded beam. Assume ideal 
elastic— plastic behaviour. Assume cr y = 250 MPa and 
E = 200 GPa. 

7. Figure Q7 shows a simply supported beam carrying 
two concentrated loads at the positions indicated. 
Given that the beam has a rectangular cross-section 
as shown, calculate the deflection of the beam 

at a position 3 m from the left-hand end and at 
a position 5 m from the right-hand end. 

(E steel = 200 GPa.) 

|P = 10kN|P = lOkN 



All dimensions in metres 

Figure Q7 

8. Find the slope at point A and the deflection at point 
B of the beam shown in Figure Q8. 

(El = 4 X 10 6 Nm 2 .) 

w = 10 kN/m 



Figure Q8 


9. A steel beam with both ends built in has an 

effective span of 7.5 m and carries a concentrated 
load of 60 kN at a point 4.5 m from the left-hand 
end. Calculate the position and magnitude of the 
maximum deflection due to this load, given that it 
occurs at a point between the left-hand end 
and the concentrated load. (E = 208 X 10 9 Nm 2 ; 
I = 85 X 10 6 mm 4 .) 

10. Find the critical load for a steel column of 

rectangular cross-section 50 X 25 mm and length 
3 m for each of the following assumptions: 

(a) the ends are hinged; 
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(b) the ends are built in; 

(c) one end is hinged and restrained from moving 
horizontally, the other end is clamped. 

11. A new tripod for a surveying instrument will 
consist of three equal tubular legs, hinged at the 
top and resting on points at the bottom. When the 
instrument is set up for use, the top hinge points are 
equally spaced on a 100 mm diameter horizontal 
circle, while the pointed ends are equally spaced 
on a 1 m diameter circle. The top hinge points are 
1.3 m above the level ground, on which the pointed 
ends rest. The greatest expected instrument weight is 
80 N. Assuming a factor of safety of 10 against elastic 
buckling, select the lightest safe aluminium tube 
(E = 70 GPa) from the following stock list. 


Thickness (mm) 

Outer diameters available (mm) | 

1 

5,10,15, 20, 25, 30 

2 

10,15, 20, 25, 30,40 

3 

15, 20, 25, 30, 40,50 


12. A hollow steel column, 9 m long and of circular 

cross-section, 90 mm outer diameter and 75 mm inner 
diameter is subjected to an end thrust of 30 kN; the 
line of action of the thrust is parallel to the unstrained 
line of the strut, but does not coincide with it. Under 
load the maximum deviation of the strut from the 
straight is 80 mm. Find the eccentricity of the load 
and the maximum compressive and tensile stresses. 
The ends may be assumed to be hinged. 

Assume E = 200 GPa 


13 . Figure Q13 shows the cross-scetion of a solid beam 
which carries a vertical shear force of 100 kN. 


I 50 I 


Determine: 

(a) the shear stress just above and just below X-X; 

(b) the shear stress at the neutral axis of the section. 

(c) Sketch the shear stress distribution through the 
section and state where the maximum shear 
stress occurs. 


14. Show that the difference between the maximum and 
mean shear stress in the web of an I beam is Sd I 2 /24I, 
where d is the height of the web. 

15. The outer dimensions of a channel girder section 
are 120 mm (web) X 50 mm (flanges); the web and 
flanges are 5 mm thick. Determine the position of 
the shear centre of the section. 

16. Find the shear centre of the beam cross-section 
shown in Figure Q16 (it is much smaller than R). 



Figure Q16 


17. A heat exchanger consists of a cylindrical vessel 
which contains 35 U-shaped tubes of 20 mm bore 
and 30 mm outside diameter. The ends of these 
tubes are welded into one of the flat end plates. The 
total length of the tubes within the vessel is 7 m. 
Calculate the hoop and axial stresses on the inside 
and outside of the straight parts of the tubes, remote 
from the bend and the ends, due to pressures of 10 
bar in the vessel (i.e. outside the tubes) and 100 bar 
inside the tubes. 

18. A compound tube is made up of an inner steel tube, 
4.5 in internal diameter and 6.5 in external diameter, 
on which is shrunk an outer steel tube that is 9 in 
external diameter. If the radial pressure at the 
common surface is 4000 lbf/in 2 , find the maximum 
and minimum circumferential stresses in both tubes 
due to shrinkage and plot the distribution of hoop 
stress across the wall of the compound tube. 

19. A circular saw that is 5 mm thick and has a 900 mm 
diameter has a bore of 100 mm. The steel, of which 
the saw is made, has a density of 7800 kg m -3 , and 
T] = 0.3. Find the maximum speed permitted if the 
hoop stress is restricted to 240 MPa. What is the 
maximum value of the radial stress? 

20 . A bronze gyro-wheel has a moment of inertia of 0.4 
kg nr and can be run with safety at 3000 rpm. A 
larger steel wheel of similar shape is required to give 
an angular momentum of 40 X 10 3 kgm 2 s _1 at 1800 
rpm. In what ratio must all the linear dimensions 
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be increased, and what elastic limit is required for 
the steel so that both wheels have the same factor of 
safety. (Density of bronze = 8480 kgm -3 ; Elastic 
limit of bronze = 78 MPa.) 


21 . Calculate the principal second moments of area and 
the directions of the principal axes for the section 
shown in Figure Q21. 




AH dimensions in mm 


22 . A 50 mm X 30 mm X 5mm angle is used as 
a cantilever of 500 mm, with the 30 mm leg 
horizontal and uppermost. A vertical load of 1000 N 
is applied at the free end. Determine the position 
of the neutral axis and the maximum tensile and 
compressive bending stresses. 


23 . Calculate the maximum tensile stress and the 

position of the neutral axis for the section shown in 
Figure Q23, when a bending moment of 225 Nm is 
applied about the x-axis in the sense shown. 



24 . Using strain energy, derive an expression for the 
deflection of the load point of the beam shown in 
Figure Q24. 



Figure Q24 


25 . Calculate the deflection beneath the force for the 
cantilevered bracket shown in Figure Q25. (E = 200 
GPa; G = 80 GPa.) 



Figure Q25 


26 . Derive an expression for the increase in distance 
between the ends A and B of a thin bar of uniform 
cross-section consisting of of a semi-circular portion 
CD and two straigth portions AC and BD as shown 
in Figure Q26. 

If the bar is of diameter 6 mm, R is 40 mm and is to 

p 

have a spring stiffness, -g of 1000 kg/m, show that 
the necessary length for L is approximately 210 mm. 
The bar is made from mild steel with Young’s 
modulus E = 210 GPa. 



Figure Q26 


27 . A circular connecting rod is made from a steel 
having the following properties: 

• UTS: 950 MN/m 2 

• YP: 800 MN/m 2 

• Fatigue limit: 500 MN/m 2 

The rod is subjected to a fully reversed axial load 
of 180kN. Determine the minimum rod diameter, 
allowing a factory for safety of 2, if the rod end 
produces a fatigue strength reduction factor of 2.1, 
where the stress concentration factor is 2.5. 

28 . A mild steel cantilever beam of circular cross-section 
is subjected to a load at its free end which varies 
cyclically from P to — 3P as shown in Figure Q28. 
Determine the maximum value of P if the fatigue 
strength reduction factor for the fillet is 1.85 and 

a safety factor of 2.0 is assumed. (Hint: use the 
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Goodman Diagram and apply the fatigue strength 
reduction factor to the stress amplitude only.) 



applied load 


Figure Q28 

29. A circular steel shaft having a transverse oil hole 
is subjected to a torsional load which varies from 
-100 Nm to 400 Nm (i.e. values in opposite senses). 
Determine the necessary shaft diameter, assuming 
that the hole causes a fatigue strength reduction 
factor of 1.75 and making use of a factor of safety of 
1.5. Assume the following properties for the steel: 

• Ultimate tensile strength: 400 NM/m 2 

• Fatigue endurance limit: 260 NM/ m 2 


Nj = N 3 = 20, N 2 = N 4 = 40. Find the moment of 
inertia of the system referred to the input (bottom) 
shaft, ignoring the inertia of the gears themselves. 



30. 


A rectangular section mild steel beam with the 
cross-sectional dimensions shown in Figure Q30, has 
a temperature given by: 


-50co s (I)°C 


Find the factor of safety of the beam against yield, at a 
section remote from the ends. Assume (T y = 250 MPa 
and E = 200 GPa. 



Figure Q30 

31 . A thin circular steel plate with a 400mm outside 
diameter and a 200mm insider diameter has a 
temperature distribution which varies approximately 
linearly with r. At the bore, T = 100°C, and at the 
outside T = 30°C. Find the growth of the bore and 
the growth of the outside diameter compared with 
their room temperature (20°C) values. 


Electromechanical Drive Systems 

1 . A machine may be regarded as consisting of the system 
shown in Figure Ql, where J\ = J 2 = 0.1 kgm 2 , 


(Hint: there are two ways to do this. Either work 
down the system, building up the contributions 
to the referred inertia for each axis, or treat each 
inertia separately and refer it directly to the input 
shaft. Both methods are equivalent and give identical 
answers.) 


2. Part of an x-y-z positioning system consists of a 

carriage which has a total mass of 6 kg and is carried 
on a sliding ball- assisted carriage (Figure Q2). There 
is no viscous friction and windage is negligible 
but the coefficient of (Coulomb) friction for the 
ball-slide may be regarded as 0.05. The carriage 
is attached to a continuous toothed belt, which 
passes over two pulleys of effective (pitch) diameter 
20 mm, one of which is connected directly to the 
motor that is used for positioning the carriage. 

The total mass of the belt is 50 g. Each pulley may 
be regarded as a solid cylinder with mass 10 g and 
diameter 18 mm. 
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(a) What is the moment of inertia of the system 
referred to the pulley that is connected to the 
motor? 

(b) The carriage is required to move from one 
end of the 600 mm-long bed to the other in 
1 second. Assuming that this is achieved by a 
period of constant acceleration over 300 mm, 
followed by constant deceleration over the 
remaining 300 mm, find the maximum linear 
acceleration of the carriage and hence find the 
maximum angular acceleration of the shaft. 

(c) Find the frictional force (ignore the frictional 
effect of the weight of the belt), the frictional 
torque and the total torque required from 
the motor in order to achieve the necessary 
acceleration while overcoming the friction. 

3. A mass of 100 kg is moved in a straight line using 
a lead screw of pitch 5 mm. As it is a ball-assisted 
high-quality leadscrew, its efficiency is high (95%), 
and it may be approximated as a solid cylinder of 
density 7800 kgm -3 , diameter 25 mm and length 
2 m. What torque must be applied to the leadscrew 
to accelerate the mass from rest to 0.2 m/ s over a 
distance of 0.5 m? Assume constant acceleration. 
(Hint: work out the linear acceleration of the mass 
using one of the kinematic formulae, then noting 
that 2jt radians of rotation equates to 0.005 m 

of movement, work out the angular acceleration 
needed. Work out the moment of inertia of the 
screw itself, then use equation 4.26 to obtain the 
torque.) 

4. A ‘penny-farthing’ (Figure Q4, a primitive type 
of bicycle, having a very large front wheel driven 
directly by the pedals) is being ridden by a person of 
weight 80 kg. The penny-farthing has a total mass 
of 20 kg, of which 8 kg is concentrated in the rim 
of the front wheel. The front wheel has a radius of 
0.65 m. 



Figure Q4 


(a) Calculate the inertia of the system referred to 
the axis of the wheel. Ignore the contribution to 
the rotational inertia of the front wheel spokes; 
also ignore the rotational inertia of the very small 
rear wheel. 


(b) Calculate how much torque is required at the 
pedals to obtain an acceleration of 0.5 m/s 2 : 

(i) when the penny-farthing is being ridden on 
a level surface; 

(ii) when it is climbing a gradient of 1 in 50 or 
2% (equivalent to an angle of inclination of 
0.02 radian). 

5. A vehicle has mass 1600 kg. This total includes four 
wheels, each of mass 15 kg and diameter 0.7 m, 
which may be regarded for inertia purposes as solid 
discs. 

(a) What is the moment of inertia of each wheel 
about its own axis? 

(b) What is the moment of inertia of the vehicle 
referred to the axis of the wheels? 

6. A mixing head for stirring a slurry has the following 
torque-speed characteristic when mixing is taking 
place: 

L = 100 + w + O.lw 2 

where L is the torque in Nm and oo is the angular 
velocity in rad/s. 

It is driven via a 50:1 worm gear of efficiency 45% 
from an induction motor with the following torque- 
speed characteristic in its operating region: 

L' = 0.1(100 it - «') 
where L' and oo' are again in Nm and rad/s 
respectively. 

(a) What is the torque-speed characteristic of the 
mixing head, referred to the axis of the motor 
i.e. expressed as L' vs oo'? 

(b) At what speed will the motor run when driving 
the mixing head? Express your answer in rad/s 
and rev/min. 

(c) At what speed (in rad/ s and rev/ min) will the 
mixing head itself rotate? 

(d) What power does the motor produce when 
driving the mixing head under load? 

7. A floating container is pulled along a channel of 
water using a chain, which is hauled using a winch 
of diameter 300 mm. The drag force F on the 
container is related to its speed by the following 
equation: 

F = 500 v 2 

where Eis in N and v is in m/s. 

The winch is to be driven from a petrol engine with 
the following characteristics: 

L' = 1 + 0.06(i)' — O.OOlco' 2 
which is valid within the range 100 < to <450 rad/s. 
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(a) Refer the force-speed characteristics of the 
container to the axis of the winch. 

(b) Hence, refer the force-speed characteristics of the 
container to the axis of the engine for the following 
combinations of gear ratio and efficiency: 

(i) 30:1,65% 

(ii) 40:1,55% 

(c) In both cases, find the combination of torque 
and angular velocity at which the engine will run 
when driving the winch. Hence, decide whether 
either of these combinations is feasible, and which 
will make the more effective use of the system. 

8 . A 220V dc series motor has the flux vs, current 
characteristic: 


Flux(mWb) 

1 16.5 

22.3 

24.4 

25.0 

Current (A) 

| 20 

30 

45 

60 


The armature resistance is 0. 1 Q and the field 
resistance is 0.2 fl. 

The motor runs at a speed of 10 rev s -1 and draws a 
current of 50 A. 

Calculate: 

(a) emf induced in the armature; 

(b) design constant, k. 

The current is reduced to 30A. 

(c) Calculate the new speed. 

9 . A 240V dc shunt motor has an armature resistance 
of 0.6 £2 and field winding resistance of 160 fl.The 
motor field characteristic (<|> versus field current) is 
shown in Figure Q9. 



Calculate: 

(a) field current. 

Given that the motor drives a constant load torque 
of 10 Nm, calculate: 

(b) armature current; 

(c) speed. 


10. A separately excited motor is rated at 25 kW, 400V, 
70 A and 1000/1500 rev/min. It is operated with 
the rated field voltage at speeds below 1000 rev/min 
and with field weakening above 1000 rev/min. 

(a) Calculate the motor constant and the armature 
resistance R a . 

(b) Calculate the field voltage, armature voltage 
and armature current if the motor is running 
at its rated power and at its maximum speed 
of 1500 rev/min, taking account of armature 
resistance. 

(c) The motor is used to drive a load with the 
torque-speed characteristic T = 25 + w, where 
the units of T are Nm and those of to are 
rad/s. It is desired to drive this load at 800 rev/ 
min. Calculate the required armature and field 
voltages and the armature current drawn, 
taking account of the effects of armature 
resistance. 

11. A 220 Vdc series motor with an armature resistance 
of 0.4 £2 and a field resistance of 0.267 £2 drives a 
load at a speed of 520 rev min -1 . The supply current 
is 15 A. 

(a) Calculate the load torque. 

(b) The load torque is doubled and the supply 
current rises to 35A. Calculate: 

(i) the new speed; 

(ii) the mechanical output power. 

12 . A 25 kW, 415 V, 50 Hz, 1440 rev/min cage 
induction motor is fed from a variable-frequency 
supply. The voltage and frequency are varied in 
proportion, with rated frequency corresponding 
to rated voltage. This arrangement is used to 
drive a blower which has a torque-speed 
characteristic given by T = 6 X 10 -3 w 2 , where T 
is in Nm and to is in rad/s. Determine the frequency 
and line-to-line voltage at a blower speed of 1300 
rev/min. 

13. (a) Explain the reasons why the use of a pneumatic 

motor may in some situations be preferable 
to the use of an electric motor. Illustrate your 
argument with proper graphs and examples 
where appropriate. Explain the shortcomings of 
pneumatic motors. 

(b) Draw a diagram illustrating the various features 
of a compressed air system, such as may be 
found in a typical factory. Briefly identify the 
function of each component and explain how 
the air pressure in the system is controlled. 
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Feedback and Control Theory 

1 . (a) Figure Ql(a) shows a multi-loop system. Determine the overall transfer function of the 
system shown in Figure Ql(a), relating the output Y a (s) to the input X,(s) 



Figure Q1(a) 


(b) Figure Ql(b) and Figure Ql(c) show two 

block diagrams for the same system. Determine 
the transfer functions G(s) and FI(s) of the 
block diagram shown in Figure Ql(c), that are 
equivalent to those of the block diagram of 
Figure Ql(b). 



Figure Q1(b) 



Figure Q1(c) 


2 . Figure Q2 shows a system for controlling the level 
of liquid in a tank. The tank has a fixed cross- 
sectional area A, fixed linearized flow resistance R 
for the outflow (relating the outflow to the liquid 
level) and variable liquid level h(t). For this system, 
the difference between the actual level h in the tank 
and the desired level h, is used to form the error 
signal e(f), where the actual level h is measured by 
a transducer. This error signal is fed to a controller 
that drives a variable speed pump such that the 
volumetric inflow rate in the tank is given by 
Q,(i) = G c (s) E(s), where G c (s) is the transfer 
function of the controller and E(s) is the Laplace 
transform of the error signal t(t). In addition, there 
is an uncontrolled disturbance inflow to the tank 
which is given by q d (t). 


For the case when the controller only contains 

proportional action with gain K: 

(a) Draw the block diagram of the system, taking h s 
to he the input and h to he the output. 

(b) Determine the overall transfer function relating 
h to and q and show that the system is of first 
order. 

(c) Show the location of the closed loop pole of the 
system in the 5-plane. 

(d) With q d = 0,i£A = 2,R = 10_and K = 0.3 and 
a constant demand level = h„ in consistent 
units, find the steady-state response in terms of h v 





3 . Figure Q3 shows a schematic of a system for 
controlling the angular velocity fl of a cable 
drum of radius R under conditions where the 
cable tension T is variable. The drum, which has 
a moment of inertia Jd is driven via n: 1 speed 
reduction gearing by a servo-motor with rotating 
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parts having a moment of inertia J M . Viscous friction 
of coefficient C resists the rotation of the drum. 

The feedback signal V 0 is derived from a tacho- 
generator and is subtracted from the demand signal 
K c fl n to form the error signal. The controller is 
a simple proportional controller with gain K and 
delivers current I to the servomotor that develops 
a drive torque L.The transfer functions for the 
tacho-generator and servo-motor, respectively, are as 
follows: 

V Q (s) _ K g Us) _ K m 
ft(s) 1 + T G s ; I(s) 1 + T m s 



friction C 

Figure Q3 

(a) Draw a block diagram for the system and derive 
the overall transfer function relating the drum 
speed ft to the demand speed ft r) , and the cable 
tension T. 

(b) For the case when T G = 0, determine the 
order of the system and derive expressions for 
appropriate parameters which characterize 
completely the transient behaviour. 

4. The block diagram in Figure Q4 represents a system 
for controlling the speed of a rotor that is driven by 
a field-controlled electric motor. A tacho-generator 
is used to provide the feedback signal. Tm> T h , and 
T x are time constants associated with the motor, 
rotor and tacho-generator respectively, and K is 
a gain associated with the motor. For a particular 
system, the time constants have the following 
numerical values: T M = 0.5, T t = 1 and T x = 0.1. 

(a) Find the minimum value of K for which the 
steady state error in rotor speed (ft; — ftj 
following a step change in input is 2% or less. 

(b) Find the maximum value of K for which the 
system is stable. For this maximum value of K 
determine the frequency at which the system 
would oscillate if disturbed. 


Motor Rotor 



Tacho-Generator 


Figure Q4 

5. The block diagram in Figure Q5 represents a system 
for controlling the rotational speed of an inertia 
load that is driven by a diesel engine. K and T c 
are respectively the gain and time constant of the 
fuel injector. The demand speed is «,(<) and the 
actual load speed is co Q (f) (these are expressed in the 
Laplace space as ft;(.s) and ft D (s) respectively). 

(a) Derive an expression for the maximum value of 
K for which the closed loop system is stable. 

(b) Show that the steady state error in speed a);-a> 0 
following a step change in demand is zero and 
derive an expression for the steady state error 
when a ramp input is applied. 

(c) With reference to your answers to parts (a) 
and (b), explain whether it is better for the 
fuel injector time constant T c to be shorter or 


Questions 


Fuel 

Injector Engine Load 



Figure Q5 


6 . (a) Figure Q6 shows the block diagram of a system 

for controlling the temperature of liquid in a tank. 
For the case where the controller is a proportional 
controller with transfer function G(s) = K, draw 
the root locus plot for the system. 

(b) To reduce steady state errors, the controller is 
modified to incorporate integral action so that 
the controller transfer function is now given by: 

Gc« = K(^±l) 

Draw the root locus plot for the modified system 
and comment on the practical significance of the 
differences between the plots for the original and 
modified systems. 


Controller Tank + liquid 



Thermometer 


Figure Q6 


7. Figure Q7 shows a schematic of a system for 
controlling the angular position (expressed in the 
Laplace domain) 0 o (s) of a satellite dish. The dish 
assembly has a moment of inertia J D and is driven 
via n: 1 speed reduction gearing by a servo motor 
with rotating parts having a moment of inertia J M . 

A viscous friction of coefficient c resists the motion 
of the dish assembly. The feedback signal V Q (s) is 
derived from a potentiometer and is subtracted from 
the demand signal KpG^s), to form the error signal. 
The controller is a simple proportional controller 
with gain K and delivers current / to a servo-motor 
that develops a drive torque L M (s). The transfer 
functions for the potentiometer and servo-motor, 
respectively, are as follows: 

Vo(s) _ Lm(s) _ Km 
0 o (j) p; i(s) i + t m s 

(a) Draw a block diagram for the system and derive 
the overall transfer function relating the angular 
position of the dish 0 o (s) to the demand rotation 


(b) For the case when Tm ~ 0. determine the 
order of the system and derive expressions for 
appropriate parameters which characterize 
completely the transient behaviour. 



8 . Find the numbers of positive roots of the following 
equations using the Routh-Hurwitz criterion: 

(a) i 3 + 4s 2 + s - 6 = 0 

(b) 5 4 - 5 3 - 2s 2 - 2s + 4 = 0 

(c) j 5 + 4 s 4 - 14s 3 - 46s 2 + 25s + 150 = 0 

9. Plot the closed loop root loci for the situations 
where the open loop transfer function is given 
by following transfer functions combined with a 
variable gain K: 

w (s 2 + 2s + 2)(j + 2) 

^ (s + l)(s + 2)(j + 3 )(s + 4) 

Hence, for each case, determine the value of gain K 
for marginal stability, and (if appropriate) the natural 
frequency at marginal stability. 
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Structural Vibration 

Natural frequencies and mode shapes 

1 . Derive the equation of motion and hence find the 
natural frequencies for the system shown in Figure 
Ql. 

Uniform rigid beam, mass m 



2. A wheel (radius r, mass m, moment of inertia about 
its centre I) can roll without slipping on a horizontal 
plane. It is restrained by a horizontal spring (stiffness k) 
attached at one end to the centre of the wheel and 
at the other end to a rigid vertical wall, as in Figure 
Q2. Derive the equation of motion and hence find 
the natural frequency for the system. 

Figure Q2 

What would the natural frequency be if there was 
no friction between the wheel and the plane? 


3. Derive the equations of motion for the system in 
Figure Q3. Assume that all displacements are small. 



Figure Q3 


4. Find the natural frequencies and mode shapes for 
the system in Figure Q3 when fe, = 10 kN/m, 
k 2 = 30 kN/m, m 1 = m 2 = 5 kg and m 3 = 10 kg. 


5. Derive the equations of motion for the system in 
Figure Q5. Assume that all displacements and angles 
are small. 



6 . Derive the frequency equation for flexural vibration 
of a uniform beam that is free at both ends and find 
an expression for the mode shape function. 

7. A 25 mm diameter shaft, 1.5 m long, is held by 
two roller bearings at one end (giving a ‘clamped’ 
boundary condition) and by a self-aligning ball 
bearing at the other end (giving a ‘pinned’ boundary 
condition). Using the roots of the appropriate 
frequency equation given in Table 6.3 on page 398, 
find the first three critical speeds of the shaft. 

Response of damped single-degree-of-freedom 
systems 

8. If a heavily damped structure is given an initial 
displacement Z 0 and then released from rest, find 
the constants of integration and sketch the graph of 
z(t) against time. 

9. A critically damped structure is subjected to an 
impulse such that it acquires an instantaneous initial 
velocity, V Q , while the displacement remains zero. 
Show that the subsequent displacement is given by 
z(t) = V 0 t e““«‘. 

10. The rigid beam shown in Figure Q10 has a moment 
of inertia of 10 kgm 2 about the pivot at A. End C is 
displaced downwards by 10 mm from its equilibrium 
position and then released from rest. Find the 
maximum upward displacement of C from its 
equilibrium position and the elapsed time at which 
this occurs. 



Figure Q10 
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Questions 


1 1 . Figure Q1 1 shows a rocker arm with moment of 
inertia I Q , about the pivot at O. Rubber blocks 
at ends A and B can each be modelled as a spring 
(stiffness, k) in parallel with a viscous damper 
(damping coefficient, c).The base of the block at 
A is attached to a rigid foundation. The base of 
the block at B is attached to a follower, which is 
driven by a cam that gives the follower a sinusoidal 
displacement of amplitude Y, and frequency «. 
Derive an expression for the steady-state amplitude 
of the displacement at A, assuming that the angular 
displacement of the bar is small. 



12. For an undamped system with n degrees of freedom, 
show that the steady-state response in coordinate 
j due to a sinusoidal force of amplitude P and 
frequency w applied in coordinate k is given by 



Approximate methods 

13. Use Dunkerley’s Method and Rayleigh’s Method 
to estimate the lowest natural frequency of the 
torsional system shown in Figure Q13. 

nh f i f i 

Figure Q13 

14. A shaft with universal joints at each end has a length 
of 6 m, a second moment of area of 0.00025 m 4 and 
a mass/unit length of 75 kg/m. It carries three discs, 
which can be regarded as point masses of 100, 150, 
and 200 kg located 1.2, 3 and 4.8 m from the left- 
hand end. Estimate the lowest critical speed using 
Dunkerley’s and Rayleighs Methods. Take E = 207 
GN/m 2 , p = 7800 kg/m 2 . 


15. Find a single-degree-of-freedom approximate model 
to analyse the motion of the 1 kg mass in the system 
in Figure Q15 when it is vibrating near its lower 
natural frequency. 



16. Use the model from Question 15 to estimate the 
steady-state response of each mass in the two- 
degree-of-freedom system due to a sinusoidal force 
of amplitude 10 N and frequency 3.8 Hz applied to 
the 1 kg mass. 

Vibration control techniques 

17. Derive the displacement transmissibility for the 
system in Figure Q17. 



Figure Q17 


18. A compressor of mass 300 kg is to be installed on 
four isolators, two at each end. The centre of mass 
is 0.4 m from end A and 0.2 m from end B. In the 
end elevation, the isolators are located symmetrically 
with respect to the centre of mass. 

Isolators are available with stiffnesses of 40, 70, 

110, 180 and 290 kN/m and each has a maximum 
allowable static deflection of 10 mm. Select suitable 
isolators for the installation so that the isolation 
efficiency is at least 70% at the normal running 
speed of 870 rev/ min. Estimate the actual isolation 
efficiency for each of the isolators you select. 
Neglect damping. 
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absolute humidity 59—60 
ac induction motors See induction 
motors 

adiabatic process 51, 96 
AFR 76-77, 78-79, 133 
air conditioning 46, 57—68 
principles of operation 63 
air-fuel ratio (AFR) 76-77, 78-79, 
133 

air gap flux 297 
air rich combustion 77 
air standard Diesel cycle 132 
air standard Otto cycle 131, 132 
aircraft 313, 392-394 
wings 395, 404 

Amagat s law of partial volumes 7 1 

amplifiers 341 

amplitude of response 413 

angle of arrival 369 

angle of departure 369 

angle valve 30 

angular velocity, Laplace transform 
340 

antenna, high-speed rotating 263 
antenna system See radar antenna 
apparent gas constant 73 
apparent molar mass 70, 73 
Argand diagram 361 
armature 268—269 

emf induced in winding 270—272 
asymmetric channel section, shear 
stress distribution in 190—191 
asymmetrical bending 207—217 
See also second moments of 
area beams with asymmetric 
sections 213-217 
atmospheric air 59 
automatic valves 91 
automotive applications, dc motors 
284 

autopilot 317 
Avogadro number 59 
axisymmetric conduction 101 


back pressure turbine 129 
‘bank run’ 321 
base speed 299 
baseball 21, 23 

battery-operated appliances 284 
BDC (bottom dead centre) 91 
beam, with added masses 442-443 
See also uniform beams 
beam bending, rectangular section 
withEPP 161-163 
beam bending equation 150, 207 
beam— columns 178—183 
beam deflection 223 
bending moment 395—396 
bending stiffness 382 
bends, flow around 28—29 
benzene 79-80 
best-efficiency point (BEP) 45 
bifurcation point 170 
black body 105 
black body heat transfer 106 
black body view factor 106 
Blasius profile 10 
block on cantilever beam 
torsional vibration 382—383 
vertical vibration 382 
block diagrams 322 
building 337-347 
components 337—343 
systems 344-347 
conversion to transfer function 
348-350 

fibre tensioning system 325, 
332-334 

heater controller 322—323, 
331-332,348 

with overlapping control loops 
350-351 

radar antenna 346, 349-350 
and s domain 331—334 
blowers 263, 282, 314, 342-343 
bluff bodies, drag 20-22 
Bode plot 360-361 


bottling plant 306 
bottom dead centre (BDC) 91 
bounce, and pitch 389-392 
bounce mode 388 
boundary conditions, basic types of 
support 397 
boundary layer 1, 8—18 

formulae for development 15 
Reynolds number 8—9 
tripping 18, 21 
velocity profile 10—12 
boundary-layer equations 16—17 
brake horsepower 42, 44 
brake mean effective pressure 267 
brake power (bp) 134 
brake specific fuel consumption 135 
brake thermal efficiency 135 
brakes 264 
breakin points 368 
breakout points 368 
brittle materials, failure 145—146 
brushless dc motors 285 
Buckinghams theorem 37—39 
buckling 168-176 

compressively loaded rod 180 
buildings, tall 395 

calorific values 85 
cameras 318 
camshaft 133 
cantilever beams 225—226 

See also block on cantilever beam 
clamped-free 400-402 
forced response 444-446 
with mass at free end 402—404 
mode shape estimation 441—442 
capacity coefficient 44 
capacity rate 115 
caravan, single-axle 385—389, 
407-408 

carbon brushes 268, 269 
carburettors 136 
Carnot cycle 53, 121-122 
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Carnot efficiency 55, 121 
Castigliano, Carlo Alberto 217, 221 
Castigliano s theorem 217, 

220-222 

cause, and effect 378 
cavitation number 34 
centrally loaded beam 159 
centrifugal blower 252 
centrifugal governor 318—319 
centrifugal pumps 42-45, 263 
cetane 132 
cetane number 132 
char 82 

characteristic equation 357, 359, 
366-367 

characteristic function 359 
characteristic length 104 
characteristic polynomial 357 
Chernobyl nuclear disaster 321 
choke 318 

circular cylinder, flow around 19—20 
circular shafts, torsion 163-167, 219 
clamped-clamped beam 157—159 
clearance volume 91 
closed feed heaters 126, 128-129 
closed-loop feedback control, cruise 
control example 319-320 
closed-loop speed control 300 
closed systems 49 
clutches 264,316 
coefficient of drag 262 
coefficient of performance 68 
coefficient of thermal expansion 239 
coil spring isolators 447-448 
combined heat and power 129—130 
combined loading 139—143 
bending and axial load 140 
bending and torsion 141 
methodology 142 
pressure, axial and torsional 
loading 141—142 

combined mode heat transfer 101, 
108 

combustible gases 131 
combustion 74—90 

closed system 82-83, 85-87 
complete 76 
incomplete 76 

non-standard beginning and end 
conditions 85—87 
non-stoichiometric 75—76 
open system 83-84, 87 
stoichiometric 75—76 
combustion energy 82 
combustion gases 46 


combustion reactions 46 
commutator 269 

compatibility, principle of 402, 403 
complementary energy 221 
complementary function 412 
complete intercooling 95 
compound drive trains 252 
compound motors 273, 282-283 
cumulative 283 
differential 283 
compression 

double-acting 90, 95 
heat transfer to jacket 93 
multistage 90, 94-95 
single-acting 90, 95 
single-stage 90 
work done during 92-93 
compression ignition (Cl) 132 
compression ignition engine 132, 
133 

compression stroke 133, 134 
compressors 54, 263 

double-acting two-stage 95 
efficiency 93-94 
reciprocating 417—418 
single-acting two-stage 95 
condensing boilers 78 
condensing hot reservoir 66 
conduction 97-98 
axisymmetric 101 
conservation of energy 49 
law of 47, 251 
constant life diagrams 230 
constant pressure heat addition 132 
constant of proportionality 341 
constant volume heat removal 132 
constant volume heating 132 
continuity equation 5—6 
control algorithm 320, 352 
control systems 

dynamic response 356—360 
frequency response 360-361 
root locus method applied to 
366-372 
stability 362-364 
control volume 47-48 
controller, transfer function 343 
convection 99-102 
mechanism 102 

convective heat transfer coefficient 
99 

correction factor 113 
correlation 102 
Coulomb friction 259 
counter flow 109, 112 


coupled equations 387 
crack extension force 233 
crack initiation 227 
crack propagation 227 
crack tip driving force 233 
crippling load 171, 177, 178 
critical (buckling) load 170, 179 
critical crack extension force 
(toughness) 233, 234 
critical Reynolds number 7 
critical speeds 395 

critical strain energy release rate 233, 
234 

critical stress intensity factor 
233-234 

critically damped system 357, 358 

cross-flow 113-115 

cruise control 317, 319—320, 343 

cup and cone failure 144 

curvature 150 

cycle efficiency 53 

cycles 48 

cylinders 

See also thick cylinders 
circular, flow around 19-20 
thin 195 

damage tolerant approach 228 
damped multi-degree-of-freedom 
systems 424-431 
forced response 429-431 
frequency response function 
(FRF) 430-431 
modal seating 425—426 
orthogonality of modes 424-425 
damped natural frequency 410, 417 
damped single-degree-of-freedom 
systems 405-423 
equations of motion 405-408 
estimating damping 412 
free vibration 408—411 
critical damping 410 
high damping 409 
tight damping 410 
zero damping 409 
frequency response function 
(FRF) 415-418, 

420-421,423 

harmonic excitation 412—415 
periodic excitation 418-423 
damped vibration absorber 455 
damper 339 
damping 405 
critical 410 
estimating 412 
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high 409 

identifying experimentally 431 
light 410 
proportional 430 
damping coefficient 405 
damping ratio 358, 410 
Darcy friction factor 26 
Darcy— Weisbach equation 26 
dashpot 339 
dc motors 268-285 

armature equivalent circuit 272 
construction 268-269 
emf induced in armature winding 
270-272 

operation 269-270 
rectified supplies for 285-292 
torque 272, 340-341 
transfer function 340 
types 273-285 
dc output voltage 285-290 
dc series motor 273, 278-282 
dc servomotors 284 
dc shunt motor 273, 274-278 
speed control 276-278 
torque-speed characteristics 
275-276 
decibels 364 
deflection curve 149 
deflector 25 
degrees of freedom 377 
density 83-84, 137,216 
derivative time constant 352 
deviatoric planes 148 
devolatilization 81 
dew point 60 
diagonal matrices 426 
diesel 132 
Diesel cycle 132 
Diesel engines 134,267 

performance assessment 134-137 
Diesel knock 135 
differencing junctions 322,325, 

343 

differential equation of elastic line 
150 

dimensionless numbers, in 

convective heat transfer 
103-104 

dimensions, of physical variables 38 
diode bridge rectifier 286-287 
three-phase 288-289 
direct current motors See dc motors 
discontinuities 150 
discrete Fourier transform (DFT) 

419 


disk valve 29-30 
displacement 407 
displacement thickness 14 
displacement transmissibility 450 
door, sprung 358 
drag, on immersed bodies 19—25 
drag coefficient 15, 19, 34, 36-37 
three-dimensional bodies 22 
two-dimensional bodies 22 
drag force 260 
drive ratio 251 

drive shaft, aero engine 392—394 
dry air 59 

dry bulb thermometer 61 
dry products 78 
ductile materials 
failure 144-145 
yielding 146 

dummy load 222, 224-225 
Dunkerley, Stanley 437 
Dunkerley s method 437-439 
dynamic friction 259—260 
dynamic response 356—360 
dynamic similarity 40 
dynamically equivalent systems 443 

earthquakes 395 

eccentrically loaded struts 176—178, 
181 

effect, cause and 378 
effective length 178 
effective radius 252 
effectiveness-number of transfer 

units (s-NTU) 110,115-118 
efficiency, of pump 44 
eigenvalue problems 387-388 
elastic crack tip stress fields 234-235 
elastic instability 168-184 
elastic line 149 

differential equation of 150 
elastic modulus See Young’s 
Modulus 

elastic-perfectly plastic (EPP) 
159-160 

elastic-plastic deformations 144-146, 
159-168 

elastic-plastic material behaviour 
models 159-161 
elastic unloading 164 
elastomeric isolators 447 
electric mixers 282 
elliptical hole, in large plate 231-232 
emission, heat 105 
emissivity 105 
endurance limit 229 


energy audit 48 
energy balance 110 
energy budget 48 
energy conservation See 

conservation of energy 
energy inventory 48 
engines 266 
enthalpy 50, 69 

enthalpy of combustion 83—84 
enthalpy of formation 84 
entropy 55, 70 
equation of state 50 
equilibrium 48 
of forces 402 
of moments 403 
states 168 
error 319, 352 

Euler buckling load 171, 177, 178 
Euler number 34 
evaporating cold reservoir 66 
evaporative cooling 58 
excess air ratio 77 
excitation function 408 
exhaust stroke 134 

experimental modal analysis 431-436 
extensive properties 50 

factor of safety 231, 376 
Fanning friction factor 26 
fans 263 
fatigue 227-231 
life analysis 228 
fatigue crack growth 237-238 
fatigue life 229 
fatigue limit 229 
fatigue notch factor 230 
fatigue strength 229 
feed heaters 126, 128-129 
feedback 318 

fibre tensioning system, modelling 
323-326, 332-334 
field current 269 
final-value theorem 337 
finite element method 404 
fireworks 75 

first law of thermodynamics 49—50 
five-degrees-of-freedom system 
423 

fixed— fixed strut 173 
fixed— free strut 172—173 
fixed— hinged strut 173—174 
flange 

horizontal shear in 189 
transverse shear in 188—189 
flash chambers 128 
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fiat belt systems 251-252 
flexural rigidity 150 
flow restrictions 342-343 
flow reversal 17 
flow separation 17 
flow separation point 17 
flow work 50 
fluids, in motion 167-185 
flux linkage 271, 298 
flywheel 339-340 
food processors 282 
force excitation 415 
force transmissibility 449 
forced convection 99 
fouling factors 118 
four-pole induction motor 294 
four-stroke cycle engine 132 
Fourier series 295, 419 
Fourier’s law 97 
fracture 227 

fracture mechanics 231-238 
energy approach to 232-237 
fatigue crack growth 237-238 
fracture toughness 233-234, 235 
fre e-body diagram 379 
free convection 99 
free-fixed strut 172-173 
free-free beam, impact test 434-435 
frequency, units 381 
frequency equations 388, 397, 399 
numerical values of roots 398 
for particular end conditions 398 
frequency response 360-361 
frequency response function (FRF) 
415-418, 420-421,423, 
430-431 

frequency response function testing 
432-435 

friction, positive aspects 264 

friction-based drive systems 251-252 

friction drag 19 

friction factor 25-27 

friction rollers 251-252 

friction velocity 12 

frictional effects, summary 262 

frictional head loss 28 

frictional losses 259-262 

Froude number 5, 34 

fuel injection systems 136, 267 

fuel oils 132 

fuel rich combustion 77 

fuelling systems 136 

fundamental 295 

gain 322, 341,352 


gain margin 364 
gas constant, for mixture 69 
gas mixtures 69—74 
gas turbine engine 392—394 
gas and vapour compressors 46 
gate valve 29-30 
gear ratio 250, 253 
geared systems 250-256 
efficiency 253 

influence on apparent inertia 255 
inertia 254 
general equation 397 
generalized Hooke’s law 197, 239 
geometric similarity 40 
Gerber parabola 229 
Gibbs-Dalton law 58-61, 69 
globe valve 29-30 
golf ball 21 

Goodman diagrams 230 
Grashof number 34, 36, 104 
gravimetric analysis 69—71 
gravitational load 263 
gross (higher) CV at constant 
pressure 85 

gross (higher) CV at constant 
volume 85 

ground source heat pumps 68 

harmonic excitation 412-415 
harmonics 295 
head 342 

head coefficient 44 
heat engines 53 
heat exchangers 109—119 
rating 111 
sizing 111 
heat pumps 66-68 
heat removal 132 
heat transfer 96-108 
calculation 110 
gas mixture 74 
maximum possible 115 
steady-state 97 

heat transfer coefficient 98, 103 
heater controller 321—323, 331—332, 
348,355 

Heaviside step function 329, 335 
helicopters 392, 395 
45° helix failure 145—146 
hinged— hinged strut 170-172 
with initial curvature 175—176 
hoists 264 

Hooke’s law 144, 338 
generalized 197, 239 
‘howling’ 362 


humidity 58, 59 
absolute 59-60 
relative 60 

hydraulic cylinders 309 
hydraulic diameter 31 
power supplies 310 
hydraulic jack 308 
hydraulic motors 309-310 
hydraulic systems 308—311 
pneumatic systems vs. 308 
hydraulic variable-ratio drives 
310-311 

hydraulically smooth 12 
hydrocarbons 75 
naming 76 
hydrostatic line 147 
hydrostatic stress 147 
hygrometry 61 

I-section, shear stress distribution in 
187-189 

ideal gas See perfect gases 
ideal intermediate pressure 96 
ideal struts 170-176 
ideal work done 92 
imaginary boundary 48 
imaginary unit 360 
impact test 434-435 
impedance 296 
impedance matching 311 
indicated mean effective pressure 
(imep) 134 

indicated power (ip) 134 
indicator diagram 91 
induction motors 292-305 
See also torque 
approximate characteristics 
301-304 

torque 299, 302-303 
torque-speed characteristics 
299-300, 302-303 
induction stroke 91, 133, 134 
inductive reactance 296 
inertia 248 
scaling 254 
inertia base 452 
initial-value theorem 335—336 
instability 321 
insulating heat 46 
integral time constant 352 
intensive properties 50 
intercooling 95 

internal combustion engines 267, 
312 

reciprocating 131—137 
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internal energy 49 

of combustion 82-83, 84 
inverters 294-295 
three-phase 297 
ironless motors 284 
irreversible processes 49 
isentropic compression 132 
isentropic efficiency 55, 126 
isentropic expansion 132 
isentropic process 51 
isentropic work 55 
isobaric process 51 
isochoric process 51 
isolation efficiency 451 
isolation region 451 
isolators 

coil spring 447-448 
elastomeric 447 
pneumatic vibration 447 
selection 451-453 
isothermal compression 93 
isothermal efficiency 93 
isothermal flows 2 
isothermal process 51, 96 
isotropic hardening 160 

X-factor 28-29 
Karman s momentum integral 
equation 14 

kinematic hardening 160—161 
kinematic viscosity 4, 103 
kinetic energy, maximum 439 
knock 135 

Lame’s constants 198 
laminar boundary layer 10 

formulae for development 15 
laminar flow 7—8 
Laplace transforms 326—330 
of commonly encountered 
functions 327-328 
definition 327 
linearity 328 
rules obeyed 328-329 
in solving differential equations 329 
laser instruments 446 
lathe 316 

laws of thermodynamics 
first 49-50 
second 49, 53-54 
corollaries 53 
length scale 4, 16 
lift coefficient 34, 36-37 
linear elastic fracture mechanics 
(LEFM) 231-232 


linear inertia 248 
linear systems 381 
linearization 342-343 
LMTD 110-115 
load characteristics 248 

See also steady-state characteristics 
of loads 

local Nusselt number 103 
locomotives 313 
logarithmic mean temperature 

difference (LMTD) 110-115 
logarithmic velocity profile 1 1 
long-shunt motor 273, 282 
lumped mass-spring systems 
384-392, 443-444 

Macaulay, WH. 151 
Macaulay brackets 152 
Macaulays convention 152 
Macaulays method 151-153 
summary 156 
Mach number 34, 36 
machine cycle 91 
magnetic flux density 297 
magnetomotive force (mmf) 282- 
283, 297 

mass, transfer function 337—338 
mass balance 5 

mass fraction, and volume fraction 
72, 73-74 

mass-spring-damper systems 357— 
359, 365-366, 406, 448-451 
mass-spring systems 379—382 
lumped 384-392, 443-444 
maximum sheer strain energy (von 
Mises) yield criterion 146 
maximum sheer stress (Tresca) yield 
criterion 146 

maximum static deflection 453 
Mead, Thomas 318 
mean stress, effect on fatigue life 229 
mechanical efficiency 135 
mechanical power, sources 266—268 
metal deformation 73-74 
Millennium Bridge 377, 429, 454 
minimum advance for best torque 
(MBT) timing 136 
minimum static deflection 453 
minor losses 28 
mitre bend 28-29 
mixed stream 115 
modal coordinates 426 
modal damping matrix 430, 455 
modal mass 425—426 
modal mass matrix 425 


modal matrix 425 
modal parameters 431 
modal seating 425-426 
modal space 426 
modal stiffness 425 
modal stiffness matrix 425, 426 
mode shape 385, 388, 398, 431 
estimation 440 
model testing 40-41 
modes of vibration 378 
modified Goodman tine 229 
Mohrs circle 139-140, 209, 212 
molar analysis 71—74 
molar fraction, and volume fraction 
73 

molar reaction equations 75 
molecular mass 59 
moles 58, 72 

Moltier diagram 126, 127 
moment of inertia 248-250, 382 
concentrated mass 249 
solid cylinder 249 
thin-walled tube 249 
momentum integral equation 13—16 
momentum thickness 13 
Moody chart 12, 26 
motor field characteristic 274 
motors 266, 268 
multipass 113 
multipoint systems 136 
multistaging 94-95 

nameplate data 302 
natural convection 99 
natural frequency 358, 379, 381- 
382,431 

damped 410, 417 
estimating lowest 437-442 
undamped 409, 417 
Navier— Stokes equations 2—5 
negative feedback 320, 343 
net (lower) CV at constant pressure 
85 

net (lower) CV at constant volume 
85 

net pump head 42 
neutral equilibrium 168 
Newtonian flow 2, 4 
Newtonian fluids 260 
Newtons law of cooling 99 
Newtons second law of motion 
248, 337, 378 
rotational version 249, 378 
no-slip condition 1 
nodal tines 431 
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non-dimensional numbers 34-37 
non-slip condition 8, 14 
non-stoichiometric combustion 
75-76 

normalization 426 
notches 230 
NTU 115 

number of heat transfer units (NTU) 
115 

Nusselt number 103, 104 
Nyquist plot 361, 363-364 
Nyquist stability criterion 363—364 

octane 131 
octane number 131 
Ohms law 285 
one-seventh law 10 
open feed heaters 126, 128 
operator D method 329 
order of magnitude analysis 17 
orifice plate 342 

orthogonality of modes 424-425 
Otto cycle 131-132 
overall heat transfer coefficient 101, 
112 

overdamped system 357, 358 
overhead power lines 455—456 
oxidizers 74—75 

p-h diagram 67 
pancake motors 285 
parachute 24 
paracyclic machines 48 
parallel axis theorem 207 
parallel flow 109, 112 
Paris equation 238 
partial pressures 
law of 58-61, 69 
and partial volumes 71 
partial volumes 
law of 71 

and partial pressures 71 
particular integral 412 
pass-out turbine 129-130 
per unit slip 293 
perfect gases 51 
periodic excitation 418—423 
permanent magnet motors 273, 
284-285 

perpendicular axis theorem 209 

petrol 131 

phase angle 414 

phase diagrams 414 

phase margin 364 

phase relative to excitation 413 


physical space 426 

tt theorem 37-39 

PID control 352-354 

pipes, flow around bends in 28—29 

piston-engined aircraft 313 

pistons 309 

pitch, bounce and 389-392 
plane strain 233 
plane stress 233 
plastic collapse 180 
pneumatic cylinders 306 
pneumatic systems 306—307 

efficiency and energy utilisation 
issues 307 

hydraulic systems vs. 308 
pneumatic vibration isolators 447 
pneumatic wrenches 306 
pneumatics 305—308 
point bending moment 154—155 
poles 366, 369 
polytropic compression 93 
poly tropic process 51 
positioning system 344-345, 
352-354, 372-373 
positive displacement pumps 90, 263 
positive feedback 321 
potential energy 

concave function 169 
convex function 169 
power coefficient 44 
power sources, matching of loads to 
313-316 

power stations 47, 392 
power stroke 133, 134 
power tools 282 
Prandtl number 34, 103 
Prandtl’s boundary-layer equations 
16-17 

pre-ignition 135 
pressure coefficient 19,34 
pressure drag 19 
pressure drop, calculation 117 
pressure gradient, effect 17 
prime movers 266 
principal axes 209 
principal second moments of area 
209-212 

‘printed armature’ motors 285 
probe assembly 257—258 
process diagrams (state diagrams) 49, 
52 

processes 48, 51 
irreversible 49 
reversible 49 
product gas mixtures 46 


product moment of area 207—208 
product parallel axis theorem 208 
production tolerances 135 
products, combustion 83 
proof stress 149 
propane 78—79 
propeller shaft 392 
properties, of systems 
extensive 50 
intensive 50 

proportional damping 430 
proportional— integral-derivative 
(PID) control 352-354 
psychrometric chart 61—62 
pulse width modulation (PWM) 
295 

pump factor 117 
pumps 42-45,54,263,341 

quality governed engines 133 
quantity governed engines 133 

radar antenna 345-347, 348-350, 
355-356, 359-360, 373-375 
radiant heat transfer 105 
ramp input 355 
Rankine cycle 123-124 
with superheat 124-125 
Rayleigh, Lord 439 
Rayleigh’s method 439-442 
for shafts and beams 440-441 
reactants 83 
real boundaries 48 
reciprocating compressors 90-96 
reciprocating internal combustion 
engines 131—137 
rectifiers 285-292 
recuperator 109 
referred inertia 255 
refrigerants 46, 66 
refrigeration 66—68 
regenerative heating 126 
regenerator 109 
reheaters 125-129 
relative humidity 60 
representative velocity 104 
residual stress 163 
residues 433, 436 
resistance 296 
resistors 341 
resonance 376, 379 
resonance-induced fatigue 376 
resonant frequency 379, 417 
response/unit applied force 415 
restoring force 380 
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reversible (ideal) work done 92 
reversible processes 49 
Reynolds number 4, 34, 103-104 
boundary layer 8—9 
critical 7 
living things 10 

rigid bar, axially loaded 169—170 
rigid body mode 394 
rocker system 383-384, 406-407 
rods, compressive loading 180-183 
root locus method 365-375 
applications 372—375 
mass-spring-damper system 
365-366 

rotating discs 199-206 
rotation of axes 208-209 
rotational inertia 248 
rotational load, with viscous 
characteristics 340 
rotor 292 

rotor resistance 301 
rotor standstill reactance 301 
roughness ratio 34 
Routh array 362—363 
Routh-Hurwitz criterion 362—363 

s domain 326 

block diagrams and 331-334 
S-N design procedure 230-231 
satellite isolation system 446 
saturation pressure 59 
saturation temperature 59 
screw drives 258 

second law of thermodynamics 49, 
53-54 

corollaries 53 

second moments of area 207-212 
about parallel axes 207 
second-order ordinary differential 
equation with constant 
coefficients 408 
second-order systems 

critically damped 357, 358 
overdamped 357, 358 
underdamped 357, 358 
secondary flows 32 
self-acting valves 90 
semi-perfect gases 50-51 
sensors 341 

separately excited motor 273, 
283-284 
series motors 

connected to ac supplies 273, 282 
dc 273, 278-282 

SFEE See steady flow energy equation 


shaft with added masses 438-439, 

442- 443 

shaft whirl 394-395, 438-439 
shape factor 14—15 
shear centre 184, 191, 193-194 
shear stress 185-194 

See also transverse shear stress 
distribution 
complementary 185 
shell-and-tube heat exchangers 109, 
113 

ship engine 446 
shock absorbers 405 
short-shunt motor 273, 282—283 
shrink/interference fit 202-205 
shunt motor See dc shunt motor 
similarity principle 40 
simple harmonic motion 381 
simply supported beams 398-400 
single-axle caravan 385—389, 
407-408 

single degree of freedom structures 
378-384 

See also damped single-degree-of- 
freedom systems 

single-degree-of-freedom dynamic 
models of complex systems 

443- 446 

single-phase bridge inverter 294—295 
single-point systems 136 
singular points 368 
singularity functions 152, 155 
method of See Macaulay’s 
method 

skin-friction coefficient 14 
sliding friction 259-260 
slip 293 

small perturbations 342 
small-scale yielding 234 
Soderberg line 229 
soft starter 303 
solid/ fluid boundary 107 
solids, burning 81-82 
spark ignition engines 132, 133—134 
performance assessment 134—137 
spark ignition (SI) 131 
spark timing 135—136 
specific fuel consumption (SFC) 267 
specific heat at constant pressure 50, 
69 

specific heat at constant volume 50, 
69 

specific humidity See absolute 
humidity 

specific measures 50 


specific speed 45 

specific steam consumption (SSC) 

121 

spontaneous ignition 135 
spring, transfer function 338 
spring back 163 

spring-damper system 329—330, 
335-336, 337 
square wave 295 
stability 362-364 

Nyquist criterion 363-364 
Routh-Hurwitz criterion 
362-363 

stable equilibrium 168—169 
stalling 267 
standard conditions 83 
starting characteristics, matching 
315-316 

state diagrams (process diagrams) 49, 
52 

static deflection shape 440 
static friction (‘stiction’) 259-260 
statically determinate problems 195 
statically indeterminate beams 
157-159 

statically indeterminate problems 
196 

stationary field 268 
stator 292 

steady flow energy equation (SFEE) 
48,49 

steady-state characteristics of loads 
259-265 

matching 3 1 3-3 1 5 
modifying using transmission 
265-266 

steady-state error 352, 354-356 
steady-state heat transfer 97 
steady-state operating points 311—312 
steady-state response 413 
steam engine 318 
steam plant 129 
steam power plant 47 
steam railway locomotives 313 
steam turbine 46, 129-130 
shaft 438-439 

Stefan-Boltzmann law 105—106 
step input 354 
stiffness matrix 404 
stoichiometric combustion 75—76 
strain, thermal 239 
strain energy 217—226 

in bar under tension 218-219 
beam under bending 220 
combined 225-226 
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maximum 439 
per unit volume 219 
in shaft under torsion 219 
strain energy release rate 233 
streamlining strategy 24-25 
stress 

See also plane stress; shear stress; 

thermal stress 
residual 163 

three-dimensional 147—148 
two-dimensional 147 
stress concentration, effect 230 
stress concentration factor (SCF) 230 
stress intensity factor 233, 235 
effects of finite boundaries 
235-236 

stress-strain curves 144 
uniaxial 159-160 
stress tensor 2 
stroke 91 

Strouhal number 34 
struts 168 

eccentrically loaded 176—178, 

181 

ideal 170-176 
summing junctions 343 
supercharging 136 
superheated steam 124 
superposition, principle of 140 
surface roughness 20-22 
pipes and ducts 27 
suspension bridges 395 
swamp cooler 58 
swashplate 309 
swept-sine test 433 
swept volume 91 
swing-check valve 30 
symmetric sections 208 
system, defining 47 

T-section, shear stress distribution in 
191-192 

tachogenerator 341 
Tacoma Narrows Bridge 376, 384 
tangential drives 256 
tangentially driven loads 256—259 
tanks 341-342 
TDC (top dead centre) 91 
televisions 318 
thermal boundary layer 102 
thermal capacity 103 
thermal capacity rate 110 
thermal conductivity 97 
thermal diffusivity 103 
thermal efficiency 120, 135 


thermal radiation 105 
thermal resistance 98—99 
thermal runaway 321 
thermal strain 239 
thermal stress 239-246 

initially straight uniform beam 
239-242 

thin cylinders 245-246 
thin disc of uniform thickness 
243-245 
thermometers 61 
thick cylinders 195-206 
with body forces 199-206 
compatibility 197, 200 
equilibrium 196, 199-200 
with pistons 201-202 
stress-strain relationships 197—199, 
200-201 

thin cylinders 195 
three-dimensional stress systems 
147-148 

three-term control 352-354 
threshold stress intensity factor range 
238 

throttles 54, 267 
thyristor 286 

thyristor bridge rectifier 286-287 
three-phase 289-290 
top dead centre (TDC) 91 
torque, inflence of inefficiency on 
transmission 255 
torque-slip characteristics 301 
torque-speed curves 265-266, 
299-300, 302 

torque-speed— SFC map 267 
torsional load 191 
torsional stiffness 382 
torsional systems 392—394 
total life approach 228-231 
toughness 233, 234 
train, ground-borne vibration 446 
transfer functions 323, 334-337 
closed-loop 335, 357, 366 
conversion of block diagrams to 
348-350 

open-loop 334-335, 359, 
363-364, 366 

of simple components 337-343 
transferring heat 46 
transient response 408, 412—413 
transmissibility analysis 448-451 
transmission ratio 265-266, 313 
transverse failure 145 
transverse shear stress distribution 
184-186 


asymmetric channel section 
190-191 

circular section 186—187 
I-section 187-189 
rectangular section 186 
T-section 191-192 
Tresca yield criterion 146 
tuned vibration absorbers 377, 429, 
453-456 

turbine-alternator sets 392 
turbine rotor disc 205-206 
turbines 42, 46, 54, 129-130 
shaft 438-439 
turbocharger 136 
turbomachinery 42-43 
turbulent boundary layer 10—12 
formulae for development 15 
turbulent flow 7—8 
turbulent-shear stresses 32 
two-degrees-of-freedom system 
lowest natural frequency 
estimation 437-438 
mode shape estimation 440 
steady-state response 427-429 
two-dimensional stress systems 147 
two-pole induction motor 292-294 
two-stroke cycle engine 132—133 

ultimate products 76 
undamped multi-degree-of-freedom 
structure, forced response 
426-429 

undamped natural frequency 409, 
417 

underdamped system 357, 358 
uniform beams, flexural vibration 
395-404 

uniformly distributed load (UDL) 
153-154 

unit modal mass 426 
‘universal motor’ 273 
unmixed stream 115 
unstable equilibrium 168—169 

V2500 engine 392-394 
vacuum cleaners 282 
valve geometries 29—30 
vane motor 305—306 
vapour power cycles 54, 119—130 
variable-ratio drives 252, 316 
vee belt/pulley systems 251—252 
vehicle model, 2D 389-390 
vehicles 258 

velocity boundary layer 102 
velocity profile 10-12 
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velocity scale 4, 16 
venturi 136 

vibration control techniques 
446-456 
viscosity 260 
viscous damping 405 
viscous friction 260-261 
volatiles 81 
voltage 285 
volume fraction 

and mass fraction 72, 73—74 
and molar fraction 73 
volumetric analysis 71-74 
volumetric efficiency 93-94, 135 


von Mises yield criterion 146 
vortex shedding 34—35, 455 

wall roughness 12—13 

water, state diagram 56 

water horsepower 42 

Watt, James 318 

wavenumber 396 

web, transverse shear in 187—188 

Weber number 34, 36 

wet bulb thermometer 61 

wet products 78 

wheel-hop mode 389 

wide open throttle (WOT) 133, 267 


windage 261-262 

woodworking tools 282 

work 51-52 

work ratio 120 

work transfer, gas mixture 74 

working fluid 46 

worm gears 264 

yield criteria 144-149 
Youngs Modulus 144 

z domain 326 
zeros 366, 369 
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